Dataset statistics
| Number of variables | 41 |
|---|---|
| Number of observations | 697117 |
| Missing cells | 10023004 |
| Missing cells (%) | 35.1% |
| Duplicate rows | 18841 |
| Duplicate rows (%) | 2.7% |
| Total size in memory | 1.2 GiB |
| Average record size in memory | 1.8 KiB |
Variable types
| Numeric | 1 |
|---|---|
| Categorical | 29 |
| Unsupported | 5 |
| Boolean | 6 |
Status has constant value "Active" | Constant |
| Dataset has 18841 (2.7%) duplicate rows | Duplicates |
Guid has a high cardinality: 18841 distinct values | High cardinality |
FullName has a high cardinality: 18451 distinct values | High cardinality |
FirstName has a high cardinality: 11655 distinct values | High cardinality |
Surname has a high cardinality: 10229 distinct values | High cardinality |
IdNumber has a high cardinality: 14913 distinct values | High cardinality |
AllergyType has a high cardinality: 99 distinct values | High cardinality |
EmergencyContactNumber has a high cardinality: 2672 distinct values | High cardinality |
EmergencyContactFullName has a high cardinality: 2906 distinct values | High cardinality |
AlternativePickupContactNumber has a high cardinality: 626 distinct values | High cardinality |
BirthDate has a high cardinality: 2019 distinct values | High cardinality |
StartDate has a high cardinality: 811 distinct values | High cardinality |
Franchisee.Guid has a high cardinality: 3639 distinct values | High cardinality |
Caregiver.FullName has a high cardinality: 17723 distinct values | High cardinality |
Caregiver.FirstName has a high cardinality: 9230 distinct values | High cardinality |
Caregiver.Surname has a high cardinality: 9578 distinct values | High cardinality |
Caregiver.IdNumber has a high cardinality: 15284 distinct values | High cardinality |
Caregiver.ContactNumber has a high cardinality: 11312 distinct values | High cardinality |
Caregiver.Guid has a high cardinality: 18103 distinct values | High cardinality |
AllergyType is highly imbalanced (52.8%) | Imbalance |
HasAllergy is highly imbalanced (96.4%) | Imbalance |
HasDisability is highly imbalanced (97.2%) | Imbalance |
EthnicGroup is highly imbalanced (82.5%) | Imbalance |
GrantType is highly imbalanced (66.0%) | Imbalance |
InactiveReason is highly imbalanced (83.4%) | Imbalance |
Caregiver.RelationshipType is highly imbalanced (66.5%) | Imbalance |
Caregiver.HighestEducationLevel is highly imbalanced (88.4%) | Imbalance |
IdNumber has 34632 (5.0%) missing values | Missing |
AllergyType has 657971 (94.4%) missing values | Missing |
DisabilityType has 697006 (> 99.9%) missing values | Missing |
HealthConditions has 697117 (100.0%) missing values | Missing |
EmergencyContactNumber has 579161 (83.1%) missing values | Missing |
EmergencyContactFullName has 578643 (83.0%) missing values | Missing |
EmergencyContactFirstName has 697117 (100.0%) missing values | Missing |
EmergencyContactSurname has 697117 (100.0%) missing values | Missing |
AlternativePickupFirstName has 697117 (100.0%) missing values | Missing |
AlternativePickupSurname has 697117 (100.0%) missing values | Missing |
AlternativePickupContactNumber has 658970 (94.5%) missing values | Missing |
BirthDate has 62345 (8.9%) missing values | Missing |
HasDisability has 346468 (49.7%) missing values | Missing |
Gender has 31968 (4.6%) missing values | Missing |
EthnicGroup has 211714 (30.4%) missing values | Missing |
HomeLanguage has 226033 (32.4%) missing values | Missing |
GrantType has 11655 (1.7%) missing values | Missing |
InactiveReason has 647833 (92.9%) missing values | Missing |
Caregiver.IdNumber has 90058 (12.9%) missing values | Missing |
Caregiver.ContactNumber has 227587 (32.6%) missing values | Missing |
Caregiver.RelationshipType has 278832 (40.0%) missing values | Missing |
Caregiver.HighestEducationLevel has 519591 (74.5%) missing values | Missing |
Caregiver.Language has 676212 (97.0%) missing values | Missing |
Unnamed: 0 is uniformly distributed | Uniform |
Guid is uniformly distributed | Uniform |
DisabilityType is uniformly distributed | Uniform |
HealthConditions is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
EmergencyContactFirstName is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
EmergencyContactSurname is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
AlternativePickupFirstName is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
AlternativePickupSurname is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2023-06-13 10:55:30.528237 |
|---|---|
| Analysis finished | 2023-06-13 10:56:19.285962 |
| Duration | 48.76 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
| Distinct | 18841 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9420 |
| Minimum | 0 |
|---|---|
| Maximum | 18840 |
| Zeros | 37 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 942 |
| Q1 | 4710 |
| median | 9420 |
| Q3 | 14130 |
| 95-th percentile | 17898 |
| Maximum | 18840 |
| Range | 18840 |
| Interquartile range (IQR) | 9420 |
Descriptive statistics
| Standard deviation | 5438.9321 |
|---|---|
| Coefficient of variation (CV) | 0.57738133 |
| Kurtosis | -1.2 |
| Mean | 9420 |
| Median Absolute Deviation (MAD) | 4710 |
| Skewness | 0 |
| Sum | 6.5668421 × 109 |
| Variance | 29581982 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37 | < 0.1% |
| 12549 | 37 | < 0.1% |
| 12565 | 37 | < 0.1% |
| 12564 | 37 | < 0.1% |
| 12563 | 37 | < 0.1% |
| 12562 | 37 | < 0.1% |
| 12561 | 37 | < 0.1% |
| 12560 | 37 | < 0.1% |
| 12559 | 37 | < 0.1% |
| 12558 | 37 | < 0.1% |
| Other values (18831) | 696747 |
| Value | Count | Frequency (%) |
| 0 | 37 | |
| 1 | 37 | |
| 2 | 37 | |
| 3 | 37 | |
| 4 | 37 | |
| 5 | 37 | |
| 6 | 37 | |
| 7 | 37 | |
| 8 | 37 | |
| 9 | 37 |
| Value | Count | Frequency (%) |
| 18840 | 37 | |
| 18839 | 37 | |
| 18838 | 37 | |
| 18837 | 37 | |
| 18836 | 37 | |
| 18835 | 37 | |
| 18834 | 37 | |
| 18833 | 37 | |
| 18832 | 37 | |
| 18831 | 37 |
Guid
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 18841 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.8 MiB |
| 0605e301-a345-ea11-833a-00155d326100 | 37 |
|---|---|
| f3461cb9-3d49-ec11-834d-00155d326100 | 37 |
| 19bf40c4-ce49-ec11-834d-00155d326100 | 37 |
| 4dc71855-ce49-ec11-834d-00155d326100 | 37 |
| eade4824-cc49-ec11-834d-00155d326100 | 37 |
| Other values (18836) |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Characters and Unicode
| Total characters | 25096212 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0605e301-a345-ea11-833a-00155d326100 |
|---|---|
| 2nd row | 5c1e6f1a-bc45-ea11-833a-00155d326100 |
| 3rd row | 5637445f-eb45-ea11-833a-00155d326100 |
| 4th row | 4da208b6-fa45-ea11-833a-00155d326100 |
| 5th row | cdb4a38c-4f46-ea11-833a-00155d326100 |
Common Values
| Value | Count | Frequency (%) |
| 0605e301-a345-ea11-833a-00155d326100 | 37 | < 0.1% |
| f3461cb9-3d49-ec11-834d-00155d326100 | 37 | < 0.1% |
| 19bf40c4-ce49-ec11-834d-00155d326100 | 37 | < 0.1% |
| 4dc71855-ce49-ec11-834d-00155d326100 | 37 | < 0.1% |
| eade4824-cc49-ec11-834d-00155d326100 | 37 | < 0.1% |
| f4176c59-cb49-ec11-834d-00155d326100 | 37 | < 0.1% |
| 2236f3d8-ca49-ec11-834d-00155d326100 | 37 | < 0.1% |
| 3b8a7756-ca49-ec11-834d-00155d326100 | 37 | < 0.1% |
| 6d0928d9-c949-ec11-834d-00155d326100 | 37 | < 0.1% |
| 804f9214-c749-ec11-834d-00155d326100 | 37 | < 0.1% |
| Other values (18831) | 696747 |
Length
| Value | Count | Frequency (%) |
| 0605e301-a345-ea11-833a-00155d326100 | 37 | < 0.1% |
| e53d020c-6b46-ea11-833a-00155d326100 | 37 | < 0.1% |
| cdb4a38c-4f46-ea11-833a-00155d326100 | 37 | < 0.1% |
| 2b427474-5046-ea11-833a-00155d326100 | 37 | < 0.1% |
| 52abcbd9-5046-ea11-833a-00155d326100 | 37 | < 0.1% |
| 6e17db16-5146-ea11-833a-00155d326100 | 37 | < 0.1% |
| 6aab0708-5246-ea11-833a-00155d326100 | 37 | < 0.1% |
| 63745080-5246-ea11-833a-00155d326100 | 37 | < 0.1% |
| 941ae956-5446-ea11-833a-00155d326100 | 37 | < 0.1% |
| 53a902d7-5446-ea11-833a-00155d326100 | 37 | < 0.1% |
| Other values (18831) | 696747 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3391161 | |
| 1 | 3348463 | |
| - | 2788468 | |
| 5 | 2148405 | |
| 3 | 1962702 | 7.8% |
| 8 | 1320789 | 5.3% |
| 6 | 1313611 | 5.2% |
| d | 1288784 | 5.1% |
| e | 1198763 | 4.8% |
| 2 | 1198319 | 4.8% |
| Other values (7) | 5136747 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16944742 | |
| Lowercase Letter | 5363002 | 21.4% |
| Dash Punctuation | 2788468 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3391161 | |
| 1 | 3348463 | |
| 5 | 2148405 | |
| 3 | 1962702 | |
| 8 | 1320789 | 7.8% |
| 6 | 1313611 | 7.8% |
| 2 | 1198319 | 7.1% |
| 4 | 968808 | 5.7% |
| 9 | 714248 | 4.2% |
| 7 | 578236 | 3.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 1288784 | |
| e | 1198763 | |
| b | 878898 | |
| c | 856624 | |
| a | 646612 | |
| f | 493321 | 9.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2788468 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19733210 | |
| Latin | 5363002 | 21.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3391161 | |
| 1 | 3348463 | |
| - | 2788468 | |
| 5 | 2148405 | |
| 3 | 1962702 | |
| 8 | 1320789 | 6.7% |
| 6 | 1313611 | 6.7% |
| 2 | 1198319 | 6.1% |
| 4 | 968808 | 4.9% |
| 9 | 714248 | 3.6% |
Latin
| Value | Count | Frequency (%) |
| d | 1288784 | |
| e | 1198763 | |
| b | 878898 | |
| c | 856624 | |
| a | 646612 | |
| f | 493321 | 9.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25096212 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3391161 | |
| 1 | 3348463 | |
| - | 2788468 | |
| 5 | 2148405 | |
| 3 | 1962702 | 7.8% |
| 8 | 1320789 | 5.3% |
| 6 | 1313611 | 5.2% |
| d | 1288784 | 5.1% |
| e | 1198763 | 4.8% |
| 2 | 1198319 | 4.8% |
| Other values (7) | 5136747 |
FullName
Categorical
| Distinct | 18451 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.9 MiB |
| Thabo Macala | 333 |
|---|---|
| Siyabonga Khumalo | 148 |
| Enzokuhle Ngcobo | 148 |
| Asande Ndlovu | 148 |
| Hlompo Gaosekwe | 148 |
| Other values (18446) |
Length
| Max length | 61 |
|---|---|
| Median length | 46 |
| Mean length | 18.012101 |
| Min length | 3 |
Characters and Unicode
| Total characters | 12556542 |
|---|---|
| Distinct characters | 84 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mxolisi komani |
|---|---|
| 2nd row | Thateho Ramohlabi |
| 3rd row | Shenaaze van wyk |
| 4th row | Leatitia Zona |
| 5th row | Avandro Pieter Klaaste |
Common Values
| Value | Count | Frequency (%) |
| Thabo Macala | 333 | < 0.1% |
| Siyabonga Khumalo | 148 | < 0.1% |
| Enzokuhle Ngcobo | 148 | < 0.1% |
| Asande Ndlovu | 148 | < 0.1% |
| Hlompo Gaosekwe | 148 | < 0.1% |
| Sukoluhle Ngubane | 148 | < 0.1% |
| Lethokuhle Lethokuhle | 111 | < 0.1% |
| Andani Netshivhale | 111 | < 0.1% |
| Asenathi Cele | 111 | < 0.1% |
| Mpho Spandiel | 111 | < 0.1% |
| Other values (18441) | 695600 |
Length
| Value | Count | Frequency (%) |
| dlamini | 6697 | 0.4% |
| junior | 6549 | 0.4% |
| enzokuhle | 6512 | 0.4% |
| blessing | 6031 | 0.4% |
| melokuhle | 5661 | 0.3% |
| lethabo | 5328 | 0.3% |
| ndlovu | 5106 | 0.3% |
| ngubane | 4921 | 0.3% |
| lubanzi | 4588 | 0.3% |
| sithole | 4255 | 0.3% |
| Other values (16591) | 1581047 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1357419 | 10.8% |
| e | 1171494 | 9.3% |
| 944980 | 7.5% | |
| o | 897805 | 7.2% |
| i | 802789 | 6.4% |
| l | 796129 | 6.3% |
| n | 780626 | 6.2% |
| h | 606948 | 4.8% |
| u | 436045 | 3.5% |
| s | 403189 | 3.2% |
| Other values (74) | 4359118 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9926138 | |
| Uppercase Letter | 1673880 | 13.3% |
| Space Separator | 944980 | 7.5% |
| Dash Punctuation | 4329 | < 0.1% |
| Decimal Number | 3885 | < 0.1% |
| Other Punctuation | 2516 | < 0.1% |
| Control | 444 | < 0.1% |
| Modifier Symbol | 185 | < 0.1% |
| Connector Punctuation | 74 | < 0.1% |
| Open Punctuation | 37 | < 0.1% |
| Other values (2) | 74 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1357419 | |
| e | 1171494 | |
| o | 897805 | 9.0% |
| i | 802789 | 8.1% |
| l | 796129 | 8.0% |
| n | 780626 | 7.9% |
| h | 606948 | 6.1% |
| u | 436045 | 4.4% |
| s | 403189 | 4.1% |
| t | 402190 | 4.1% |
| Other values (23) | 2271504 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 318570 | |
| S | 157953 | 9.4% |
| N | 151071 | 9.0% |
| L | 136530 | 8.2% |
| A | 125615 | 7.5% |
| K | 104377 | 6.2% |
| T | 90872 | 5.4% |
| B | 75776 | 4.5% |
| O | 67118 | 4.0% |
| P | 48137 | 2.9% |
| Other values (17) | 397861 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 999 | |
| 0 | 962 | |
| 8 | 407 | |
| 7 | 370 | 9.5% |
| 2 | 370 | 9.5% |
| 9 | 259 | 6.7% |
| 3 | 222 | 5.7% |
| 5 | 148 | 3.8% |
| 4 | 111 | 2.9% |
| 6 | 37 | 1.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 962 | |
| ' | 777 | |
| , | 444 | |
| / | 296 | 11.8% |
| ? | 37 | 1.5% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 148 | |
| 🏾 | 37 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 944980 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4329 |
Control
| Value | Count | Frequency (%) |
| 444 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 74 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 37 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37 |
Other Symbol
| Value | Count | Frequency (%) |
| 👋 | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11600018 | |
| Common | 956524 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1357419 | 11.7% |
| e | 1171494 | 10.1% |
| o | 897805 | 7.7% |
| i | 802789 | 6.9% |
| l | 796129 | 6.9% |
| n | 780626 | 6.7% |
| h | 606948 | 5.2% |
| u | 436045 | 3.8% |
| s | 403189 | 3.5% |
| t | 402190 | 3.5% |
| Other values (50) | 3945384 |
Common
| Value | Count | Frequency (%) |
| 944980 | ||
| - | 4329 | 0.5% |
| 1 | 999 | 0.1% |
| . | 962 | 0.1% |
| 0 | 962 | 0.1% |
| ' | 777 | 0.1% |
| 444 | < 0.1% | |
| , | 444 | < 0.1% |
| 8 | 407 | < 0.1% |
| 7 | 370 | < 0.1% |
| Other values (14) | 1850 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12555913 | |
| None | 629 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1357419 | 10.8% |
| e | 1171494 | 9.3% |
| 944980 | 7.5% | |
| o | 897805 | 7.2% |
| i | 802789 | 6.4% |
| l | 796129 | 6.3% |
| n | 780626 | 6.2% |
| h | 606948 | 4.8% |
| u | 436045 | 3.5% |
| s | 403189 | 3.2% |
| Other values (64) | 4358489 |
None
| Value | Count | Frequency (%) |
| é | 148 | |
| è | 111 | |
| à | 74 | |
| ë | 74 | |
| ç | 37 | 5.9% |
| Ñ | 37 | 5.9% |
| ķ | 37 | 5.9% |
| ñ | 37 | 5.9% |
| 👋 | 37 | 5.9% |
| 🏾 | 37 | 5.9% |
FirstName
Categorical
| Distinct | 11655 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.8 MiB |
| Melokuhle | 2923 |
|---|---|
| Enzokuhle | 2886 |
| Lethabo | 2405 |
| Lesedi | 1850 |
| Mpho | 1850 |
| Other values (11650) |
Length
| Max length | 46 |
|---|---|
| Median length | 33 |
| Mean length | 10.366966 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7226988 |
|---|---|
| Distinct characters | 80 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mxolisi |
|---|---|
| 2nd row | Thateho |
| 3rd row | Shenaaze |
| 4th row | Leatitia |
| 5th row | Avandro Pieter |
Common Values
| Value | Count | Frequency (%) |
| Melokuhle | 2923 | 0.4% |
| Enzokuhle | 2886 | 0.4% |
| Lethabo | 2405 | 0.3% |
| Lesedi | 1850 | 0.3% |
| Mpho | 1850 | 0.3% |
| Karabo | 1850 | 0.3% |
| Omphile | 1702 | 0.2% |
| Rethabile | 1628 | 0.2% |
| Bokamoso | 1591 | 0.2% |
| Alunamda | 1554 | 0.2% |
| Other values (11645) | 676878 |
Length
| Value | Count | Frequency (%) |
| enzokuhle | 6253 | 0.7% |
| junior | 5883 | 0.6% |
| melokuhle | 5402 | 0.6% |
| blessing | 5328 | 0.6% |
| lethabo | 5143 | 0.6% |
| lubanzi | 4292 | 0.5% |
| karabo | 3848 | 0.4% |
| lesedi | 3478 | 0.4% |
| lethokuhle | 3404 | 0.4% |
| omphile | 3293 | 0.4% |
| Other values (8553) | 881451 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 741036 | 10.3% |
| a | 680689 | 9.4% |
| o | 549894 | 7.6% |
| l | 516483 | 7.1% |
| i | 495356 | 6.9% |
| n | 480223 | 6.6% |
| h | 401820 | 5.6% |
| 347208 | 4.8% | |
| u | 258038 | 3.6% |
| t | 246827 | 3.4% |
| Other values (70) | 2509414 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5926216 | |
| Uppercase Letter | 944647 | 13.1% |
| Space Separator | 347208 | 4.8% |
| Dash Punctuation | 3811 | 0.1% |
| Decimal Number | 2923 | < 0.1% |
| Other Punctuation | 1665 | < 0.1% |
| Control | 259 | < 0.1% |
| Connector Punctuation | 74 | < 0.1% |
| Modifier Symbol | 74 | < 0.1% |
| Other Symbol | 37 | < 0.1% |
| Other values (2) | 74 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 741036 | |
| a | 680689 | |
| o | 549894 | |
| l | 516483 | 8.7% |
| i | 495356 | 8.4% |
| n | 480223 | 8.1% |
| h | 401820 | 6.8% |
| u | 258038 | 4.4% |
| t | 246827 | 4.2% |
| s | 243867 | 4.1% |
| Other values (21) | 1311983 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 110260 | |
| A | 107559 | |
| S | 90576 | |
| M | 79439 | 8.4% |
| K | 67710 | 7.2% |
| N | 66156 | 7.0% |
| T | 59385 | 6.3% |
| O | 58386 | 6.2% |
| B | 48137 | 5.1% |
| E | 33744 | 3.6% |
| Other values (17) | 223295 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 851 | |
| 0 | 666 | |
| 7 | 333 | 11.4% |
| 8 | 259 | 8.9% |
| 9 | 222 | 7.6% |
| 2 | 185 | 6.3% |
| 3 | 148 | 5.1% |
| 4 | 111 | 3.8% |
| 5 | 111 | 3.8% |
| 6 | 37 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 740 | |
| ' | 592 | |
| , | 333 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 37 | |
| 🏾 | 37 |
Space Separator
| Value | Count | Frequency (%) |
| 347208 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3811 |
Control
| Value | Count | Frequency (%) |
| 259 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 74 |
Other Symbol
| Value | Count | Frequency (%) |
| 👋 | 37 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6870863 | |
| Common | 356125 | 4.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 741036 | 10.8% |
| a | 680689 | 9.9% |
| o | 549894 | 8.0% |
| l | 516483 | 7.5% |
| i | 495356 | 7.2% |
| n | 480223 | 7.0% |
| h | 401820 | 5.8% |
| u | 258038 | 3.8% |
| t | 246827 | 3.6% |
| s | 243867 | 3.5% |
| Other values (48) | 2256630 |
Common
| Value | Count | Frequency (%) |
| 347208 | ||
| - | 3811 | 1.1% |
| 1 | 851 | 0.2% |
| . | 740 | 0.2% |
| 0 | 666 | 0.2% |
| ' | 592 | 0.2% |
| , | 333 | 0.1% |
| 7 | 333 | 0.1% |
| 259 | 0.1% | |
| 8 | 259 | 0.1% |
| Other values (12) | 1073 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7226507 | |
| None | 481 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 741036 | 10.3% |
| a | 680689 | 9.4% |
| o | 549894 | 7.6% |
| l | 516483 | 7.1% |
| i | 495356 | 6.9% |
| n | 480223 | 6.6% |
| h | 401820 | 5.6% |
| 347208 | 4.8% | |
| u | 258038 | 3.6% |
| t | 246827 | 3.4% |
| Other values (62) | 2508933 |
None
| Value | Count | Frequency (%) |
| é | 148 | |
| è | 111 | |
| 👋 | 37 | 7.7% |
| ķ | 37 | 7.7% |
| à | 37 | 7.7% |
| ë | 37 | 7.7% |
| Ñ | 37 | 7.7% |
| 🏾 | 37 | 7.7% |
Surname
Categorical
| Distinct | 10229 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 370 |
| Missing (%) | 0.1% |
| Memory size | 42.5 MiB |
| Dlamini | 5587 |
|---|---|
| Ndlovu | 3663 |
| Mahlangu | 2923 |
| Sithole | 2812 |
| Ngubane | 2479 |
| Other values (10224) |
Length
| Max length | 30 |
|---|---|
| Median length | 28 |
| Mean length | 6.8755244 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4790501 |
|---|---|
| Distinct characters | 72 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | komani |
|---|---|
| 2nd row | Ramohlabi |
| 3rd row | van wyk |
| 4th row | Zona |
| 5th row | Klaaste |
Common Values
| Value | Count | Frequency (%) |
| Dlamini | 5587 | 0.8% |
| Ndlovu | 3663 | 0.5% |
| Mahlangu | 2923 | 0.4% |
| Sithole | 2812 | 0.4% |
| Ngubane | 2479 | 0.4% |
| Khumalo | 2183 | 0.3% |
| Ngubane | 2072 | 0.3% |
| Mokoena | 2072 | 0.3% |
| Mbatha | 2035 | 0.3% |
| Mkhize | 1961 | 0.3% |
| Other values (10219) | 668960 |
Length
| Value | Count | Frequency (%) |
| dlamini | 6105 | 0.9% |
| ngubane | 4699 | 0.7% |
| ndlovu | 4699 | 0.7% |
| sithole | 4033 | 0.6% |
| mkhize | 3441 | 0.5% |
| mahlangu | 3367 | 0.5% |
| khumalo | 3071 | 0.4% |
| mbatha | 2923 | 0.4% |
| dladla | 2368 | 0.3% |
| mokoena | 2331 | 0.3% |
| Other values (9604) | 671513 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 676730 | |
| e | 430458 | 9.0% |
| o | 347911 | 7.3% |
| i | 307433 | 6.4% |
| n | 300403 | 6.3% |
| l | 279646 | 5.8% |
| M | 239131 | 5.0% |
| h | 205128 | 4.3% |
| u | 178007 | 3.7% |
| s | 159322 | 3.3% |
| Other values (62) | 1666332 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3999922 | |
| Uppercase Letter | 728493 | 15.2% |
| Space Separator | 59755 | 1.2% |
| Decimal Number | 962 | < 0.1% |
| Other Punctuation | 555 | < 0.1% |
| Dash Punctuation | 518 | < 0.1% |
| Control | 185 | < 0.1% |
| Modifier Symbol | 111 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 676730 | |
| e | 430458 | |
| o | 347911 | 8.7% |
| i | 307433 | 7.7% |
| n | 300403 | 7.5% |
| l | 279646 | 7.0% |
| h | 205128 | 5.1% |
| u | 178007 | 4.5% |
| s | 159322 | 4.0% |
| t | 155363 | 3.9% |
| Other values (20) | 959521 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 239131 | |
| N | 84545 | 11.6% |
| S | 67377 | 9.2% |
| K | 36667 | 5.0% |
| T | 31487 | 4.3% |
| D | 30784 | 4.2% |
| B | 27639 | 3.8% |
| L | 26270 | 3.6% |
| P | 20239 | 2.8% |
| G | 18611 | 2.6% |
| Other values (16) | 145743 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 296 | |
| 2 | 185 | |
| 8 | 148 | |
| 1 | 148 | |
| 3 | 74 | 7.7% |
| 9 | 37 | 3.8% |
| 5 | 37 | 3.8% |
| 7 | 37 | 3.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 222 | |
| ' | 185 | |
| , | 111 | |
| ? | 37 | 6.7% |
Space Separator
| Value | Count | Frequency (%) |
| 59755 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 518 |
Control
| Value | Count | Frequency (%) |
| 185 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 111 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4728415 | |
| Common | 62086 | 1.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 676730 | |
| e | 430458 | 9.1% |
| o | 347911 | 7.4% |
| i | 307433 | 6.5% |
| n | 300403 | 6.4% |
| l | 279646 | 5.9% |
| M | 239131 | 5.1% |
| h | 205128 | 4.3% |
| u | 178007 | 3.8% |
| s | 159322 | 3.4% |
| Other values (46) | 1604246 |
Common
| Value | Count | Frequency (%) |
| 59755 | ||
| - | 518 | 0.8% |
| 0 | 296 | 0.5% |
| . | 222 | 0.4% |
| 2 | 185 | 0.3% |
| ' | 185 | 0.3% |
| 185 | 0.3% | |
| 8 | 148 | 0.2% |
| 1 | 148 | 0.2% |
| , | 111 | 0.2% |
| Other values (6) | 333 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4790353 | |
| None | 148 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 676730 | |
| e | 430458 | 9.0% |
| o | 347911 | 7.3% |
| i | 307433 | 6.4% |
| n | 300403 | 6.3% |
| l | 279646 | 5.8% |
| M | 239131 | 5.0% |
| h | 205128 | 4.3% |
| u | 178007 | 3.7% |
| s | 159322 | 3.3% |
| Other values (58) | 1666184 |
None
| Value | Count | Frequency (%) |
| ç | 37 | |
| ë | 37 | |
| à | 37 | |
| ñ | 37 |
IdNumber
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 14913 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 34632 |
| Missing (%) | 5.0% |
| Memory size | 45.2 MiB |
| 0000000000012 | |
|---|---|
| 0000000000000 | 2812 |
| 000000000012 | 1813 |
| 000 | 1591 |
| 0000 | 1480 |
| Other values (14908) |
Length
| Max length | 20 |
|---|---|
| Median length | 13 |
| Mean length | 12.816867 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8490982 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0000000000012 |
|---|---|
| 2nd row | 1807095666084 |
| 3rd row | 0000000000012 |
| 4th row | 0000000000012 |
| 5th row | 1806226123086 |
Common Values
| Value | Count | Frequency (%) |
| 0000000000012 | 88356 | 12.7% |
| 0000000000000 | 2812 | 0.4% |
| 000000000012 | 1813 | 0.3% |
| 000 | 1591 | 0.2% |
| 0000 | 1480 | 0.2% |
| 000000000000 | 925 | 0.1% |
| 0000000000 | 888 | 0.1% |
| 00000000000 | 444 | 0.1% |
| 0000000000123 | 296 | < 0.1% |
| 0000000000001 | 185 | < 0.1% |
| Other values (14903) | 563695 | |
| (Missing) | 34632 | 5.0% |
Length
| Value | Count | Frequency (%) |
| 0000000000012 | 88356 | 13.3% |
| 0000000000000 | 2812 | 0.4% |
| 000000000012 | 1813 | 0.3% |
| 000 | 1591 | 0.2% |
| 0000 | 1480 | 0.2% |
| 000000000000 | 925 | 0.1% |
| 0000000000 | 888 | 0.1% |
| 00000000000 | 444 | 0.1% |
| 0000000000123 | 296 | < 0.1% |
| none | 259 | < 0.1% |
| Other values (14907) | 563806 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2755390 | |
| 1 | 1419098 | |
| 8 | 1059421 | 12.5% |
| 2 | 699781 | 8.2% |
| 5 | 496688 | 5.8% |
| 7 | 456099 | 5.4% |
| 6 | 450734 | 5.3% |
| 9 | 433011 | 5.1% |
| 3 | 368964 | 4.3% |
| 4 | 332926 | 3.9% |
| Other values (24) | 18870 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8472112 | |
| Other Punctuation | 13801 | 0.2% |
| Dash Punctuation | 2553 | < 0.1% |
| Lowercase Letter | 1147 | < 0.1% |
| Uppercase Letter | 1073 | < 0.1% |
| Space Separator | 259 | < 0.1% |
| Modifier Symbol | 37 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 370 | |
| B | 148 | 13.8% |
| A | 148 | 13.8% |
| M | 111 | 10.3% |
| F | 74 | 6.9% |
| S | 37 | 3.4% |
| R | 37 | 3.4% |
| C | 37 | 3.4% |
| D | 37 | 3.4% |
| J | 37 | 3.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2755390 | |
| 1 | 1419098 | |
| 8 | 1059421 | 12.5% |
| 2 | 699781 | 8.3% |
| 5 | 496688 | 5.9% |
| 7 | 456099 | 5.4% |
| 6 | 450734 | 5.3% |
| 9 | 433011 | 5.1% |
| 3 | 368964 | 4.4% |
| 4 | 332926 | 3.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 333 | |
| o | 333 | |
| e | 296 | |
| h | 37 | 3.2% |
| b | 37 | 3.2% |
| z | 37 | 3.2% |
| w | 37 | 3.2% |
| v | 37 | 3.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 13764 | |
| . | 37 | 0.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2553 |
Space Separator
| Value | Count | Frequency (%) |
| 259 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8488762 | |
| Latin | 2220 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 370 | |
| n | 333 | |
| o | 333 | |
| e | 296 | |
| B | 148 | 6.7% |
| A | 148 | 6.7% |
| M | 111 | 5.0% |
| F | 74 | 3.3% |
| S | 37 | 1.7% |
| R | 37 | 1.7% |
| Other values (9) | 333 |
Common
| Value | Count | Frequency (%) |
| 0 | 2755390 | |
| 1 | 1419098 | |
| 8 | 1059421 | 12.5% |
| 2 | 699781 | 8.2% |
| 5 | 496688 | 5.9% |
| 7 | 456099 | 5.4% |
| 6 | 450734 | 5.3% |
| 9 | 433011 | 5.1% |
| 3 | 368964 | 4.3% |
| 4 | 332926 | 3.9% |
| Other values (5) | 16650 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8490982 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2755390 | |
| 1 | 1419098 | |
| 8 | 1059421 | 12.5% |
| 2 | 699781 | 8.2% |
| 5 | 496688 | 5.8% |
| 7 | 456099 | 5.4% |
| 6 | 450734 | 5.3% |
| 9 | 433011 | 5.1% |
| 3 | 368964 | 4.3% |
| 4 | 332926 | 3.9% |
| Other values (24) | 18870 | 0.2% |
AllergyType
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 657971 |
| Missing (%) | 94.4% |
| Memory size | 22.4 MiB |
| None | |
|---|---|
| none | |
| no | |
| No | |
| None | 814 |
| Other values (94) |
Length
| Max length | 62 |
|---|---|
| Median length | 4 |
| Mean length | 4.210775 |
| Min length | 2 |
Characters and Unicode
| Total characters | 164835 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | She must not be exposed to the sun as her nose starts bleeding |
|---|---|
| 2nd row | None |
| 3rd row | None |
| 4th row | None |
| 5th row | No |
Common Values
| Value | Count | Frequency (%) |
| None | 11803 | 1.7% |
| none | 9176 | 1.3% |
| no | 7252 | 1.0% |
| No | 5069 | 0.7% |
| None | 814 | 0.1% |
| None listed | 555 | 0.1% |
| NONE | 259 | < 0.1% |
| None Listed | 222 | < 0.1% |
| Eczema | 148 | < 0.1% |
| NO | 111 | < 0.1% |
| Other values (89) | 3737 | 0.5% |
| (Missing) | 657971 |
Length
| Value | Count | Frequency (%) |
| none | 23051 | |
| no | 12580 | |
| listed | 962 | 2.2% |
| to | 296 | 0.7% |
| eczema | 222 | 0.5% |
| tin | 185 | 0.4% |
| and | 185 | 0.4% |
| sinus | 185 | 0.4% |
| allergies | 185 | 0.4% |
| fish | 185 | 0.4% |
| Other values (109) | 5402 | 12.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 41773 | |
| o | 36926 | |
| e | 27898 | |
| N | 19536 | |
| 5587 | 3.4% | |
| s | 4107 | 2.5% |
| i | 3589 | 2.2% |
| t | 3145 | 1.9% |
| a | 2997 | 1.8% |
| l | 2442 | 1.5% |
| Other values (47) | 16835 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 134458 | |
| Uppercase Letter | 23865 | 14.5% |
| Space Separator | 5587 | 3.4% |
| Other Punctuation | 370 | 0.2% |
| Decimal Number | 333 | 0.2% |
| Control | 148 | 0.1% |
| Open Punctuation | 37 | < 0.1% |
| Close Punctuation | 37 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 41773 | |
| o | 36926 | |
| e | 27898 | |
| s | 4107 | 3.1% |
| i | 3589 | 2.7% |
| t | 3145 | 2.3% |
| a | 2997 | 2.2% |
| l | 2442 | 1.8% |
| d | 1776 | 1.3% |
| r | 1665 | 1.2% |
| Other values (15) | 8140 | 6.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 19536 | |
| S | 629 | 2.6% |
| E | 592 | 2.5% |
| A | 444 | 1.9% |
| O | 407 | 1.7% |
| B | 333 | 1.4% |
| L | 296 | 1.2% |
| P | 259 | 1.1% |
| C | 222 | 0.9% |
| D | 185 | 0.8% |
| Other values (10) | 962 | 4.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 148 | |
| 0 | 74 | |
| 6 | 37 | 11.1% |
| 5 | 37 | 11.1% |
| 9 | 37 | 11.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 185 | |
| / | 148 | |
| & | 37 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 5587 |
Control
| Value | Count | Frequency (%) |
| 148 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 37 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 158323 | |
| Common | 6512 | 4.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 41773 | |
| o | 36926 | |
| e | 27898 | |
| N | 19536 | |
| s | 4107 | 2.6% |
| i | 3589 | 2.3% |
| t | 3145 | 2.0% |
| a | 2997 | 1.9% |
| l | 2442 | 1.5% |
| d | 1776 | 1.1% |
| Other values (35) | 14134 | 8.9% |
Common
| Value | Count | Frequency (%) |
| 5587 | ||
| , | 185 | 2.8% |
| 148 | 2.3% | |
| 2 | 148 | 2.3% |
| / | 148 | 2.3% |
| 0 | 74 | 1.1% |
| ( | 37 | 0.6% |
| ) | 37 | 0.6% |
| 6 | 37 | 0.6% |
| 5 | 37 | 0.6% |
| Other values (2) | 74 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 164835 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 41773 | |
| o | 36926 | |
| e | 27898 | |
| N | 19536 | |
| 5587 | 3.4% | |
| s | 4107 | 2.5% |
| i | 3589 | 2.2% |
| t | 3145 | 1.9% |
| a | 2997 | 1.8% |
| l | 2442 | 1.5% |
| Other values (47) | 16835 |
DisabilityType
Categorical
MISSING  UNIFORM 
| Distinct | 3 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 697006 |
| Missing (%) | > 99.9% |
| Memory size | 21.3 MiB |
| no | |
|---|---|
| Chronic Illness | |
| Ashtma |
Length
| Max length | 15 |
|---|---|
| Median length | 6 |
| Mean length | 7.6666667 |
| Min length | 2 |
Characters and Unicode
| Total characters | 851 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | no |
|---|---|
| 2nd row | Chronic Illness |
| 3rd row | Ashtma |
| 4th row | no |
| 5th row | Chronic Illness |
Common Values
| Value | Count | Frequency (%) |
| no | 37 | < 0.1% |
| Chronic Illness | 37 | < 0.1% |
| Ashtma | 37 | < 0.1% |
| (Missing) | 697006 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 37 | |
| chronic | 37 | |
| illness | 37 | |
| ashtma | 37 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 111 | |
| s | 111 | |
| o | 74 | 8.7% |
| h | 74 | 8.7% |
| l | 74 | 8.7% |
| C | 37 | 4.3% |
| r | 37 | 4.3% |
| i | 37 | 4.3% |
| c | 37 | 4.3% |
| 37 | 4.3% | |
| Other values (6) | 222 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 703 | |
| Uppercase Letter | 111 | 13.0% |
| Space Separator | 37 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 111 | |
| s | 111 | |
| o | 74 | |
| h | 74 | |
| l | 74 | |
| r | 37 | 5.3% |
| i | 37 | 5.3% |
| c | 37 | 5.3% |
| e | 37 | 5.3% |
| t | 37 | 5.3% |
| Other values (2) | 74 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 37 | |
| I | 37 | |
| A | 37 |
Space Separator
| Value | Count | Frequency (%) |
| 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 814 | |
| Common | 37 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 111 | |
| s | 111 | |
| o | 74 | 9.1% |
| h | 74 | 9.1% |
| l | 74 | 9.1% |
| C | 37 | 4.5% |
| r | 37 | 4.5% |
| i | 37 | 4.5% |
| c | 37 | 4.5% |
| I | 37 | 4.5% |
| Other values (5) | 185 |
Common
| Value | Count | Frequency (%) |
| 37 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 851 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 111 | |
| s | 111 | |
| o | 74 | 8.7% |
| h | 74 | 8.7% |
| l | 74 | 8.7% |
| C | 37 | 4.3% |
| r | 37 | 4.3% |
| i | 37 | 4.3% |
| c | 37 | 4.3% |
| 37 | 4.3% | |
| Other values (6) | 222 |
HealthConditions
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 697117 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.3 MiB |
EmergencyContactNumber
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 2672 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 579161 |
| Missing (%) | 83.1% |
| Memory size | 25.2 MiB |
| 0 | 5439 |
|---|---|
| 0681145763 | 2368 |
| 0648747951 | 259 |
| 0726014177 | 259 |
| 0760251247 | 222 |
| Other values (2667) |
Length
| Max length | 20 |
|---|---|
| Median length | 10 |
| Mean length | 9.5890841 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1131090 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0635118027 |
|---|---|
| 2nd row | 0714248050 |
| 3rd row | 0625698598 |
| 4th row | 0769598598 |
| 5th row | 0738862330 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 5439 | 0.8% |
| 0681145763 | 2368 | 0.3% |
| 0648747951 | 259 | < 0.1% |
| 0726014177 | 259 | < 0.1% |
| 0760251247 | 222 | < 0.1% |
| 0799619663 | 222 | < 0.1% |
| 0818364480 | 185 | < 0.1% |
| 0787015537 | 185 | < 0.1% |
| 0790268989 | 185 | < 0.1% |
| 0718389466 | 185 | < 0.1% |
| Other values (2662) | 108447 | 15.6% |
| (Missing) | 579161 |
Length
| Value | Count | Frequency (%) |
| 0 | 5439 | 4.6% |
| 0681145763 | 2368 | 2.0% |
| 0648747951 | 259 | 0.2% |
| 0726014177 | 259 | 0.2% |
| 0760251247 | 222 | 0.2% |
| 0799619663 | 222 | 0.2% |
| 0818364480 | 185 | 0.2% |
| 0787015537 | 185 | 0.2% |
| 0790268989 | 185 | 0.2% |
| 0718389466 | 185 | 0.2% |
| Other values (2687) | 109520 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 199023 | |
| 7 | 147149 | |
| 6 | 122507 | |
| 8 | 102601 | |
| 2 | 99641 | |
| 1 | 98457 | |
| 3 | 95127 | |
| 4 | 86987 | |
| 9 | 85100 | |
| 5 | 81178 | |
| Other values (43) | 13320 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1117770 | |
| Lowercase Letter | 10027 | 0.9% |
| Uppercase Letter | 1924 | 0.2% |
| Space Separator | 1295 | 0.1% |
| Other Punctuation | 37 | < 0.1% |
| Close Punctuation | 37 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1517 | |
| a | 1406 | |
| o | 999 | |
| l | 925 | |
| n | 666 | 6.6% |
| t | 666 | 6.6% |
| i | 629 | 6.3% |
| h | 518 | 5.2% |
| s | 407 | 4.1% |
| k | 296 | 3.0% |
| Other values (12) | 1998 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 518 | |
| M | 296 | |
| L | 185 | 9.6% |
| A | 148 | 7.7% |
| T | 111 | 5.8% |
| G | 74 | 3.8% |
| N | 74 | 3.8% |
| C | 74 | 3.8% |
| O | 74 | 3.8% |
| P | 74 | 3.8% |
| Other values (8) | 296 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 199023 | |
| 7 | 147149 | |
| 6 | 122507 | |
| 8 | 102601 | |
| 2 | 99641 | |
| 1 | 98457 | |
| 3 | 95127 | |
| 4 | 86987 | |
| 9 | 85100 | |
| 5 | 81178 |
Space Separator
| Value | Count | Frequency (%) |
| 1295 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 37 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1119139 | |
| Latin | 11951 | 1.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1517 | |
| a | 1406 | 11.8% |
| o | 999 | 8.4% |
| l | 925 | 7.7% |
| n | 666 | 5.6% |
| t | 666 | 5.6% |
| i | 629 | 5.3% |
| S | 518 | 4.3% |
| h | 518 | 4.3% |
| s | 407 | 3.4% |
| Other values (30) | 3700 |
Common
| Value | Count | Frequency (%) |
| 0 | 199023 | |
| 7 | 147149 | |
| 6 | 122507 | |
| 8 | 102601 | |
| 2 | 99641 | |
| 1 | 98457 | |
| 3 | 95127 | |
| 4 | 86987 | |
| 9 | 85100 | |
| 5 | 81178 | |
| Other values (3) | 1369 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1131090 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 199023 | |
| 7 | 147149 | |
| 6 | 122507 | |
| 8 | 102601 | |
| 2 | 99641 | |
| 1 | 98457 | |
| 3 | 95127 | |
| 4 | 86987 | |
| 9 | 85100 | |
| 5 | 81178 | |
| Other values (43) | 13320 | 1.2% |
EmergencyContactFullName
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 2906 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 578643 |
| Missing (%) | 83.0% |
| Memory size | 25.6 MiB |
| Thandiwe | 296 |
|---|---|
| Kgomotso | 259 |
| Lizzy | 259 |
| Sandile Mvelase | 222 |
| Bongekile Ximba | 222 |
| Other values (2901) |
Length
| Max length | 38 |
|---|---|
| Median length | 27 |
| Mean length | 13.54466 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1604690 |
|---|---|
| Distinct characters | 71 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hans Koopman |
|---|---|
| 2nd row | Valencia Van Wyk |
| 3rd row | Eugene Louw |
| 4th row | Leandre Koopman |
| 5th row | Filicia Dawid |
Common Values
| Value | Count | Frequency (%) |
| Thandiwe | 296 | < 0.1% |
| Kgomotso | 259 | < 0.1% |
| Lizzy | 259 | < 0.1% |
| Sandile Mvelase | 222 | < 0.1% |
| Bongekile Ximba | 222 | < 0.1% |
| Kelebogile | 222 | < 0.1% |
| Lerato | 185 | < 0.1% |
| Veronica | 185 | < 0.1% |
| Zandile | 185 | < 0.1% |
| Nompumelelo | 185 | < 0.1% |
| Other values (2896) | 116254 | 16.7% |
| (Missing) | 578643 |
Length
| Value | Count | Frequency (%) |
| ngubane | 2664 | 1.3% |
| dlamini | 1369 | 0.6% |
| mkhize | 1258 | 0.6% |
| ndlovu | 1184 | 0.6% |
| sithole | 1110 | 0.5% |
| dladla | 1036 | 0.5% |
| mbatha | 1036 | 0.5% |
| khumalo | 925 | 0.4% |
| zandile | 888 | 0.4% |
| zuma | 851 | 0.4% |
| Other values (3090) | 199615 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 164058 | 10.2% |
| e | 161283 | 10.1% |
| 138380 | 8.6% | |
| i | 123950 | 7.7% |
| o | 106856 | 6.7% |
| l | 97532 | 6.1% |
| n | 97162 | 6.1% |
| h | 65971 | 4.1% |
| s | 52651 | 3.3% |
| u | 47064 | 2.9% |
| Other values (61) | 549783 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1250748 | |
| Uppercase Letter | 212787 | 13.3% |
| Space Separator | 138380 | 8.6% |
| Decimal Number | 1628 | 0.1% |
| Dash Punctuation | 629 | < 0.1% |
| Other Punctuation | 444 | < 0.1% |
| Open Punctuation | 37 | < 0.1% |
| Close Punctuation | 37 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 164058 | |
| e | 161283 | |
| i | 123950 | |
| o | 106856 | 8.5% |
| l | 97532 | 7.8% |
| n | 97162 | 7.8% |
| h | 65971 | 5.3% |
| s | 52651 | 4.2% |
| u | 47064 | 3.8% |
| t | 45843 | 3.7% |
| Other values (16) | 288378 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 43586 | |
| N | 27639 | |
| S | 21608 | |
| T | 13209 | 6.2% |
| K | 12876 | 6.1% |
| L | 10323 | 4.9% |
| B | 10175 | 4.8% |
| D | 9731 | 4.6% |
| Z | 7400 | 3.5% |
| P | 7326 | 3.4% |
| Other values (16) | 48914 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 370 | |
| 3 | 296 | |
| 1 | 222 | |
| 5 | 185 | |
| 2 | 148 | 9.1% |
| 9 | 111 | 6.8% |
| 4 | 74 | 4.5% |
| 7 | 74 | 4.5% |
| 6 | 74 | 4.5% |
| 8 | 74 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 259 | |
| ; | 74 | 16.7% |
| ' | 37 | 8.3% |
| , | 37 | 8.3% |
| / | 37 | 8.3% |
Space Separator
| Value | Count | Frequency (%) |
| 138380 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 629 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 37 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1463535 | |
| Common | 141155 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 164058 | 11.2% |
| e | 161283 | 11.0% |
| i | 123950 | 8.5% |
| o | 106856 | 7.3% |
| l | 97532 | 6.7% |
| n | 97162 | 6.6% |
| h | 65971 | 4.5% |
| s | 52651 | 3.6% |
| u | 47064 | 3.2% |
| t | 45843 | 3.1% |
| Other values (42) | 501165 |
Common
| Value | Count | Frequency (%) |
| 138380 | ||
| - | 629 | 0.4% |
| 0 | 370 | 0.3% |
| 3 | 296 | 0.2% |
| . | 259 | 0.2% |
| 1 | 222 | 0.2% |
| 5 | 185 | 0.1% |
| 2 | 148 | 0.1% |
| 9 | 111 | 0.1% |
| ; | 74 | 0.1% |
| Other values (9) | 481 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1604690 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 164058 | 10.2% |
| e | 161283 | 10.1% |
| 138380 | 8.6% | |
| i | 123950 | 7.7% |
| o | 106856 | 6.7% |
| l | 97532 | 6.1% |
| n | 97162 | 6.1% |
| h | 65971 | 4.1% |
| s | 52651 | 3.3% |
| u | 47064 | 2.9% |
| Other values (61) | 549783 |
EmergencyContactFirstName
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 697117 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.3 MiB |
EmergencyContactSurname
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 697117 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.3 MiB |
AlternativePickupFirstName
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 697117 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.3 MiB |
AlternativePickupSurname
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 697117 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.3 MiB |
AlternativePickupContactNumber
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 626 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 658970 |
| Missing (%) | 94.5% |
| Memory size | 22.4 MiB |
| 0 | |
|---|---|
| 0818325688 | 222 |
| 0790268989 | 148 |
| None | 148 |
| 0825856457 | 148 |
| Other values (621) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 7.0795344 |
| Min length | 1 |
Characters and Unicode
| Total characters | 270063 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0764810096 |
| 3rd row | 0710863033 |
| 4th row | 0714778174 |
| 5th row | 0710863033 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 12136 | 1.7% |
| 0818325688 | 222 | < 0.1% |
| 0790268989 | 148 | < 0.1% |
| None | 148 | < 0.1% |
| 0825856457 | 148 | < 0.1% |
| 0799619663 | 148 | < 0.1% |
| 0797419017 | 148 | < 0.1% |
| 0818876048 | 148 | < 0.1% |
| 0715804800 | 111 | < 0.1% |
| 0720478111 | 111 | < 0.1% |
| Other values (616) | 24679 | 3.5% |
| (Missing) | 658970 |
Length
| Value | Count | Frequency (%) |
| 0 | 12173 | |
| 0818325688 | 222 | 0.6% |
| 0790268989 | 148 | 0.4% |
| none | 148 | 0.4% |
| 0825856457 | 148 | 0.4% |
| 0799619663 | 148 | 0.4% |
| 0797419017 | 148 | 0.4% |
| 0818876048 | 148 | 0.4% |
| 0793898860 | 111 | 0.3% |
| 0761751465 | 111 | 0.3% |
| Other values (615) | 24642 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 55611 | |
| 7 | 37111 | |
| 6 | 26529 | |
| 2 | 25641 | |
| 8 | 22792 | |
| 1 | 22237 | 8.2% |
| 9 | 21201 | 7.9% |
| 4 | 20313 | 7.5% |
| 3 | 19795 | 7.3% |
| 5 | 17390 | 6.4% |
| Other values (16) | 1443 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 268620 | |
| Lowercase Letter | 1110 | 0.4% |
| Uppercase Letter | 259 | 0.1% |
| Space Separator | 37 | < 0.1% |
| Other Punctuation | 37 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 185 | |
| e | 185 | |
| o | 148 | |
| a | 111 | |
| i | 74 | 6.7% |
| l | 74 | 6.7% |
| h | 74 | 6.7% |
| s | 74 | 6.7% |
| t | 74 | 6.7% |
| g | 37 | 3.3% |
| Other values (2) | 74 | 6.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 55611 | |
| 7 | 37111 | |
| 6 | 26529 | |
| 2 | 25641 | |
| 8 | 22792 | |
| 1 | 22237 | 8.3% |
| 9 | 21201 | 7.9% |
| 4 | 20313 | 7.6% |
| 3 | 19795 | 7.4% |
| 5 | 17390 | 6.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 185 | |
| M | 74 | 28.6% |
Space Separator
| Value | Count | Frequency (%) |
| 37 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 268694 | |
| Latin | 1369 | 0.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 185 | |
| n | 185 | |
| e | 185 | |
| o | 148 | |
| a | 111 | |
| i | 74 | 5.4% |
| l | 74 | 5.4% |
| h | 74 | 5.4% |
| s | 74 | 5.4% |
| t | 74 | 5.4% |
| Other values (4) | 185 |
Common
| Value | Count | Frequency (%) |
| 0 | 55611 | |
| 7 | 37111 | |
| 6 | 26529 | |
| 2 | 25641 | |
| 8 | 22792 | |
| 1 | 22237 | 8.3% |
| 9 | 21201 | 7.9% |
| 4 | 20313 | 7.6% |
| 3 | 19795 | 7.4% |
| 5 | 17390 | 6.5% |
| Other values (2) | 74 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 270063 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 55611 | |
| 7 | 37111 | |
| 6 | 26529 | |
| 2 | 25641 | |
| 8 | 22792 | |
| 1 | 22237 | 8.2% |
| 9 | 21201 | 7.9% |
| 4 | 20313 | 7.5% |
| 3 | 19795 | 7.3% |
| 5 | 17390 | 6.4% |
| Other values (16) | 1443 | 0.5% |
BirthDate
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 2019 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 62345 |
| Missing (%) | 8.9% |
| Memory size | 48.5 MiB |
| 2018-09-20T22:00:00Z | 1369 |
|---|---|
| 2018-09-27T22:00:00Z | 1221 |
| 2018-02-06T22:00:00Z | 1184 |
| 2018-08-12T22:00:00Z | 1184 |
| 2018-04-18T22:00:00Z | 1147 |
| Other values (2014) |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Characters and Unicode
| Total characters | 12695440 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2017-02-16T22:00:00Z |
|---|---|
| 2nd row | 2018-07-08T22:00:00Z |
| 3rd row | 2016-04-10T22:00:00Z |
| 4th row | 2015-06-10T22:00:00Z |
| 5th row | 2018-10-07T22:00:00Z |
Common Values
| Value | Count | Frequency (%) |
| 2018-09-20T22:00:00Z | 1369 | 0.2% |
| 2018-09-27T22:00:00Z | 1221 | 0.2% |
| 2018-02-06T22:00:00Z | 1184 | 0.2% |
| 2018-08-12T22:00:00Z | 1184 | 0.2% |
| 2018-04-18T22:00:00Z | 1147 | 0.2% |
| 2018-04-04T22:00:00Z | 1110 | 0.2% |
| 2018-08-15T22:00:00Z | 1110 | 0.2% |
| 2018-09-04T22:00:00Z | 1110 | 0.2% |
| 2018-10-09T22:00:00Z | 1110 | 0.2% |
| 2018-09-19T22:00:00Z | 1073 | 0.2% |
| Other values (2009) | 623154 | |
| (Missing) | 62345 | 8.9% |
Length
| Value | Count | Frequency (%) |
| 2018-09-20t22:00:00z | 1369 | 0.2% |
| 2018-09-27t22:00:00z | 1221 | 0.2% |
| 2018-02-06t22:00:00z | 1184 | 0.2% |
| 2018-08-12t22:00:00z | 1184 | 0.2% |
| 2018-04-18t22:00:00z | 1147 | 0.2% |
| 2018-04-04t22:00:00z | 1110 | 0.2% |
| 2018-08-15t22:00:00z | 1110 | 0.2% |
| 2018-09-04t22:00:00z | 1110 | 0.2% |
| 2018-10-09t22:00:00z | 1110 | 0.2% |
| 2018-09-19t22:00:00z | 1073 | 0.2% |
| Other values (2009) | 623154 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3988156 | |
| 2 | 2304730 | |
| - | 1269544 | 10.0% |
| : | 1269544 | 10.0% |
| 1 | 1143411 | 9.0% |
| T | 634772 | 5.0% |
| Z | 634772 | 5.0% |
| 8 | 371369 | 2.9% |
| 7 | 290339 | 2.3% |
| 9 | 232175 | 1.8% |
| Other values (4) | 556628 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8886808 | |
| Dash Punctuation | 1269544 | 10.0% |
| Other Punctuation | 1269544 | 10.0% |
| Uppercase Letter | 1269544 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3988156 | |
| 2 | 2304730 | |
| 1 | 1143411 | 12.9% |
| 8 | 371369 | 4.2% |
| 7 | 290339 | 3.3% |
| 9 | 232175 | 2.6% |
| 6 | 173382 | 2.0% |
| 3 | 147001 | 1.7% |
| 5 | 121841 | 1.4% |
| 4 | 114404 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 634772 | |
| Z | 634772 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1269544 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1269544 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11425896 | |
| Latin | 1269544 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3988156 | |
| 2 | 2304730 | |
| - | 1269544 | 11.1% |
| : | 1269544 | 11.1% |
| 1 | 1143411 | 10.0% |
| 8 | 371369 | 3.3% |
| 7 | 290339 | 2.5% |
| 9 | 232175 | 2.0% |
| 6 | 173382 | 1.5% |
| 3 | 147001 | 1.3% |
| Other values (2) | 236245 | 2.1% |
Latin
| Value | Count | Frequency (%) |
| T | 634772 | |
| Z | 634772 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12695440 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3988156 | |
| 2 | 2304730 | |
| - | 1269544 | 10.0% |
| : | 1269544 | 10.0% |
| 1 | 1143411 | 9.0% |
| T | 634772 | 5.0% |
| Z | 634772 | 5.0% |
| 8 | 371369 | 2.9% |
| 7 | 290339 | 2.3% |
| 9 | 232175 | 1.8% |
| Other values (4) | 556628 | 4.4% |
StartDate
Categorical
| Distinct | 811 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.5 MiB |
| 2022-01-19T00:00:00 | 17316 |
|---|---|
| 2021-02-15T00:00:00 | 12358 |
| 2020-01-15T00:00:00 | 12025 |
| 2021-04-06T00:00:00 | 10730 |
| 2022-01-10T00:00:00 | 8806 |
| Other values (806) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 13245223 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2020-01-17T00:00:00 |
|---|---|
| 2nd row | 2020-01-01T00:00:00 |
| 3rd row | 2020-01-17T00:00:00 |
| 4th row | 2019-10-03T00:00:00 |
| 5th row | 2019-10-22T00:00:00 |
Common Values
| Value | Count | Frequency (%) |
| 2022-01-19T00:00:00 | 17316 | 2.5% |
| 2021-02-15T00:00:00 | 12358 | 1.8% |
| 2020-01-15T00:00:00 | 12025 | 1.7% |
| 2021-04-06T00:00:00 | 10730 | 1.5% |
| 2022-01-10T00:00:00 | 8806 | 1.3% |
| 2021-04-07T00:00:00 | 8806 | 1.3% |
| 2021-03-01T00:00:00 | 8547 | 1.2% |
| 2021-01-11T00:00:00 | 6771 | 1.0% |
| 2022-01-24T00:00:00 | 6734 | 1.0% |
| 2022-02-01T00:00:00 | 6623 | 1.0% |
| Other values (801) | 598401 |
Length
| Value | Count | Frequency (%) |
| 2022-01-19t00:00:00 | 17316 | 2.5% |
| 2021-02-15t00:00:00 | 12358 | 1.8% |
| 2020-01-15t00:00:00 | 12025 | 1.7% |
| 2021-04-06t00:00:00 | 10730 | 1.5% |
| 2022-01-10t00:00:00 | 8806 | 1.3% |
| 2021-04-07t00:00:00 | 8806 | 1.3% |
| 2021-03-01t00:00:00 | 8547 | 1.2% |
| 2021-01-11t00:00:00 | 6771 | 1.0% |
| 2022-01-24t00:00:00 | 6734 | 1.0% |
| 2022-02-01t00:00:00 | 6623 | 1.0% |
| Other values (801) | 598401 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5937797 | |
| 2 | 1960001 | 14.8% |
| - | 1394234 | 10.5% |
| : | 1394234 | 10.5% |
| 1 | 1053427 | 8.0% |
| T | 697117 | 5.3% |
| 3 | 192511 | 1.5% |
| 5 | 136271 | 1.0% |
| 4 | 135309 | 1.0% |
| 6 | 101047 | 0.8% |
| Other values (3) | 243275 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9759638 | |
| Dash Punctuation | 1394234 | 10.5% |
| Other Punctuation | 1394234 | 10.5% |
| Uppercase Letter | 697117 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5937797 | |
| 2 | 1960001 | 20.1% |
| 1 | 1053427 | 10.8% |
| 3 | 192511 | 2.0% |
| 5 | 136271 | 1.4% |
| 4 | 135309 | 1.4% |
| 6 | 101047 | 1.0% |
| 9 | 85433 | 0.9% |
| 7 | 79809 | 0.8% |
| 8 | 78033 | 0.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1394234 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1394234 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 697117 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12548106 | |
| Latin | 697117 | 5.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5937797 | |
| 2 | 1960001 | 15.6% |
| - | 1394234 | 11.1% |
| : | 1394234 | 11.1% |
| 1 | 1053427 | 8.4% |
| 3 | 192511 | 1.5% |
| 5 | 136271 | 1.1% |
| 4 | 135309 | 1.1% |
| 6 | 101047 | 0.8% |
| 9 | 85433 | 0.7% |
| Other values (2) | 157842 | 1.3% |
Latin
| Value | Count | Frequency (%) |
| T | 697117 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13245223 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5937797 | |
| 2 | 1960001 | 14.8% |
| - | 1394234 | 10.5% |
| : | 1394234 | 10.5% |
| 1 | 1053427 | 8.0% |
| T | 697117 | 5.3% |
| 3 | 192511 | 1.5% |
| 5 | 136271 | 1.0% |
| 4 | 135309 | 1.0% |
| 6 | 101047 | 0.8% |
| Other values (3) | 243275 | 1.8% |
HasAllergy
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.9 KiB |
| False | |
|---|---|
| True | 2627 |
| Value | Count | Frequency (%) |
| False | 694490 | |
| True | 2627 | 0.4% |
HasDisability
Boolean
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 346468 |
| Missing (%) | 49.7% |
| Memory size | 21.3 MiB |
| False | |
|---|---|
| True | 999 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 349650 | |
| True | 999 | 0.1% |
| (Missing) | 346468 |
CaregiverPopiaConsent
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.9 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 565323 | |
| True | 131794 | 18.9% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.9 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 550338 | |
| True | 146779 | 21.1% |
IsSouthAfricanCitizen
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.9 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 378695 | |
| True | 318422 |
HasIdNumber
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.9 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 599622 | |
| False | 97495 | 14.0% |
Gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31968 |
| Missing (%) | 4.6% |
| Memory size | 40.3 MiB |
| Female | |
|---|---|
| Male |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.013406 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3334662 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Male |
|---|---|
| 2nd row | Male |
| 3rd row | Female |
| 4th row | Male |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Female | 337033 | |
| Male | 328116 | |
| (Missing) | 31968 | 4.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 337033 | |
| male | 328116 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1002182 | |
| a | 665149 | |
| l | 665149 | |
| F | 337033 | 10.1% |
| m | 337033 | 10.1% |
| M | 328116 | 9.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2669513 | |
| Uppercase Letter | 665149 | 19.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1002182 | |
| a | 665149 | |
| l | 665149 | |
| m | 337033 | 12.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 337033 | |
| M | 328116 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3334662 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1002182 | |
| a | 665149 | |
| l | 665149 | |
| F | 337033 | 10.1% |
| m | 337033 | 10.1% |
| M | 328116 | 9.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3334662 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1002182 | |
| a | 665149 | |
| l | 665149 | |
| F | 337033 | 10.1% |
| m | 337033 | 10.1% |
| M | 328116 | 9.8% |
EthnicGroup
Categorical
IMBALANCE  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 211714 |
| Missing (%) | 30.4% |
| Memory size | 36.1 MiB |
| African | |
|---|---|
| Coloured | 33263 |
| White | 1369 |
| Other | 666 |
| Indian | 111 |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.0599131 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3426903 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | African |
|---|---|
| 2nd row | African |
| 3rd row | African |
| 4th row | African |
| 5th row | Coloured |
Common Values
| Value | Count | Frequency (%) |
| African | 449994 | |
| Coloured | 33263 | 4.8% |
| White | 1369 | 0.2% |
| Other | 666 | 0.1% |
| Indian | 111 | < 0.1% |
| (Missing) | 211714 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| african | 449994 | |
| coloured | 33263 | 6.9% |
| white | 1369 | 0.3% |
| other | 666 | 0.1% |
| indian | 111 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 483923 | |
| i | 451474 | |
| n | 450216 | |
| a | 450105 | |
| A | 449994 | |
| c | 449994 | |
| f | 449994 | |
| o | 66526 | 1.9% |
| e | 35298 | 1.0% |
| d | 33374 | 1.0% |
| Other values (8) | 106005 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2941500 | |
| Uppercase Letter | 485403 | 14.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 483923 | |
| i | 451474 | |
| n | 450216 | |
| a | 450105 | |
| c | 449994 | |
| f | 449994 | |
| o | 66526 | 2.3% |
| e | 35298 | 1.2% |
| d | 33374 | 1.1% |
| l | 33263 | 1.1% |
| Other values (3) | 37333 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 449994 | |
| C | 33263 | 6.9% |
| W | 1369 | 0.3% |
| O | 666 | 0.1% |
| I | 111 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3426903 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 483923 | |
| i | 451474 | |
| n | 450216 | |
| a | 450105 | |
| A | 449994 | |
| c | 449994 | |
| f | 449994 | |
| o | 66526 | 1.9% |
| e | 35298 | 1.0% |
| d | 33374 | 1.0% |
| Other values (8) | 106005 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3426903 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 483923 | |
| i | 451474 | |
| n | 450216 | |
| a | 450105 | |
| A | 449994 | |
| c | 449994 | |
| f | 449994 | |
| o | 66526 | 1.9% |
| e | 35298 | 1.0% |
| d | 33374 | 1.0% |
| Other values (8) | 106005 | 3.1% |
HomeLanguage
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 226033 |
| Missing (%) | 32.4% |
| Memory size | 35.9 MiB |
| isiXhosa | |
|---|---|
| isiZulu | |
| Setswana | |
| Sepedi | |
| Afrikaans | |
| Other values (6) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 7.5932297 |
| Min length | 6 |
Characters and Unicode
| Total characters | 3577049 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | isiXhosa |
|---|---|
| 2nd row | Setswana |
| 3rd row | Afrikaans |
| 4th row | Afrikaans |
| 5th row | Afrikaans |
Common Values
| Value | Count | Frequency (%) |
| isiXhosa | 144041 | |
| isiZulu | 124320 | |
| Setswana | 55685 | 8.0% |
| Sepedi | 41144 | 5.9% |
| Afrikaans | 31598 | 4.5% |
| Sesotho | 28194 | 4.0% |
| Xitsonga | 14430 | 2.1% |
| English | 9213 | 1.3% |
| Tshivenda | 8917 | 1.3% |
| isiNdebele | 8473 | 1.2% |
| (Missing) | 226033 |
Length
| Value | Count | Frequency (%) |
| isixhosa | 144041 | |
| isizulu | 124320 | |
| setswana | 55685 | 11.8% |
| sepedi | 41144 | 8.7% |
| afrikaans | 31598 | 6.7% |
| sesotho | 28194 | 6.0% |
| xitsonga | 14430 | 3.1% |
| english | 9213 | 2.0% |
| tshivenda | 8917 | 1.9% |
| isindebele | 8473 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 669108 | |
| s | 573981 | |
| a | 347023 | |
| u | 248640 | 7.0% |
| o | 214859 | 6.0% |
| e | 200503 | 5.6% |
| h | 190365 | 5.3% |
| X | 158471 | 4.4% |
| l | 142006 | 4.0% |
| S | 130092 | 3.6% |
| Other values (16) | 702001 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3105965 | |
| Uppercase Letter | 471084 | 13.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 669108 | |
| s | 573981 | |
| a | 347023 | |
| u | 248640 | 8.0% |
| o | 214859 | 6.9% |
| e | 200503 | 6.5% |
| h | 190365 | 6.1% |
| l | 142006 | 4.6% |
| n | 119843 | 3.9% |
| t | 103378 | 3.3% |
| Other values (9) | 296259 |
Uppercase Letter
| Value | Count | Frequency (%) |
| X | 158471 | |
| S | 130092 | |
| Z | 124320 | |
| A | 31598 | 6.7% |
| E | 9213 | 2.0% |
| T | 8917 | 1.9% |
| N | 8473 | 1.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3577049 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 669108 | |
| s | 573981 | |
| a | 347023 | |
| u | 248640 | 7.0% |
| o | 214859 | 6.0% |
| e | 200503 | 5.6% |
| h | 190365 | 5.3% |
| X | 158471 | 4.4% |
| l | 142006 | 4.0% |
| S | 130092 | 3.6% |
| Other values (16) | 702001 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3577049 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 669108 | |
| s | 573981 | |
| a | 347023 | |
| u | 248640 | 7.0% |
| o | 214859 | 6.0% |
| e | 200503 | 5.6% |
| h | 190365 | 5.3% |
| X | 158471 | 4.4% |
| l | 142006 | 4.0% |
| S | 130092 | 3.6% |
| Other values (16) | 702001 |
GrantType
Categorical
IMBALANCE  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11655 |
| Missing (%) | 1.7% |
| Memory size | 44.3 MiB |
| Child Grant | |
|---|---|
| None | |
| Disability Grant | 1406 |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 10.199395 |
| Min length | 4 |
Characters and Unicode
| Total characters | 6991298 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Child Grant |
|---|---|
| 2nd row | Child Grant |
| 3rd row | Child Grant |
| 4th row | Child Grant |
| 5th row | Child Grant |
Common Values
| Value | Count | Frequency (%) |
| Child Grant | 604654 | |
| None | 79402 | 11.4% |
| Disability Grant | 1406 | 0.2% |
| (Missing) | 11655 | 1.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| grant | 606060 | |
| child | 604654 | |
| none | 79402 | 6.1% |
| disability | 1406 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 685462 | |
| i | 608872 | |
| t | 607466 | |
| a | 607466 | |
| l | 606060 | |
| 606060 | ||
| G | 606060 | |
| r | 606060 | |
| h | 604654 | |
| C | 604654 | |
| Other values (8) | 848484 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5093716 | |
| Uppercase Letter | 1291522 | 18.5% |
| Space Separator | 606060 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 685462 | |
| i | 608872 | |
| t | 607466 | |
| a | 607466 | |
| l | 606060 | |
| r | 606060 | |
| h | 604654 | |
| d | 604654 | |
| o | 79402 | 1.6% |
| e | 79402 | 1.6% |
| Other values (3) | 4218 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 606060 | |
| C | 604654 | |
| N | 79402 | 6.1% |
| D | 1406 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 606060 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6385238 | |
| Common | 606060 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 685462 | |
| i | 608872 | |
| t | 607466 | |
| a | 607466 | |
| l | 606060 | |
| G | 606060 | |
| r | 606060 | |
| h | 604654 | |
| C | 604654 | |
| d | 604654 | |
| Other values (7) | 243830 | 3.8% |
Common
| Value | Count | Frequency (%) |
| 606060 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6991298 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 685462 | |
| i | 608872 | |
| t | 607466 | |
| a | 607466 | |
| l | 606060 | |
| 606060 | ||
| G | 606060 | |
| r | 606060 | |
| h | 604654 | |
| C | 604654 | |
| Other values (8) | 848484 |
PlaygroupGroup
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.1 MiB |
| None | |
|---|---|
| Group A | |
| Group B | 47878 |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.7517117 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3312499 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Group A |
|---|---|
| 2nd row | None |
| 3rd row | None |
| 4th row | Group A |
| 5th row | Group B |
Common Values
| Value | Count | Frequency (%) |
| None | 522440 | |
| Group A | 126799 | 18.2% |
| Group B | 47878 | 6.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| none | 522440 | |
| group | 174677 | 20.0% |
| a | 126799 | 14.5% |
| b | 47878 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 697117 | |
| N | 522440 | |
| n | 522440 | |
| e | 522440 | |
| G | 174677 | 5.3% |
| r | 174677 | 5.3% |
| u | 174677 | 5.3% |
| p | 174677 | 5.3% |
| 174677 | 5.3% | |
| A | 126799 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2266028 | |
| Uppercase Letter | 871794 | 26.3% |
| Space Separator | 174677 | 5.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 697117 | |
| n | 522440 | |
| e | 522440 | |
| r | 174677 | 7.7% |
| u | 174677 | 7.7% |
| p | 174677 | 7.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 522440 | |
| G | 174677 | 20.0% |
| A | 126799 | 14.5% |
| B | 47878 | 5.5% |
Space Separator
| Value | Count | Frequency (%) |
| 174677 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3137822 | |
| Common | 174677 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 697117 | |
| N | 522440 | |
| n | 522440 | |
| e | 522440 | |
| G | 174677 | 5.6% |
| r | 174677 | 5.6% |
| u | 174677 | 5.6% |
| p | 174677 | 5.6% |
| A | 126799 | 4.0% |
| B | 47878 | 1.5% |
Common
| Value | Count | Frequency (%) |
| 174677 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3312499 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 697117 | |
| N | 522440 | |
| n | 522440 | |
| e | 522440 | |
| G | 174677 | 5.3% |
| r | 174677 | 5.3% |
| u | 174677 | 5.3% |
| p | 174677 | 5.3% |
| 174677 | 5.3% | |
| A | 126799 | 3.8% |
InactiveReason
Categorical
IMBALANCE  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 647833 |
| Missing (%) | 92.9% |
| Memory size | 23.8 MiB |
| Franchisee left the programme | |
|---|---|
| My child is starting Grade R | 925 |
| We are moving to a different area | 740 |
| Other | 666 |
| My child is starting Grade 1 | 185 |
Length
| Max length | 33 |
|---|---|
| Median length | 29 |
| Mean length | 28.713213 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1415102 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Franchisee left the programme |
|---|---|
| 2nd row | Franchisee left the programme |
| 3rd row | Franchisee left the programme |
| 4th row | Franchisee left the programme |
| 5th row | Franchisee left the programme |
Common Values
| Value | Count | Frequency (%) |
| Franchisee left the programme | 46768 | 6.7% |
| My child is starting Grade R | 925 | 0.1% |
| We are moving to a different area | 740 | 0.1% |
| Other | 666 | 0.1% |
| My child is starting Grade 1 | 185 | < 0.1% |
| (Missing) | 647833 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| franchisee | 46768 | |
| left | 46768 | |
| the | 46768 | |
| programme | 46768 | |
| my | 1110 | 0.6% |
| child | 1110 | 0.6% |
| is | 1110 | 0.6% |
| starting | 1110 | 0.6% |
| grade | 1110 | 0.6% |
| r | 925 | 0.5% |
| Other values (9) | 6031 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 239316 | |
| 150294 | ||
| r | 145410 | |
| a | 98716 | 7.0% |
| t | 97902 | 6.9% |
| h | 95312 | 6.7% |
| m | 94276 | 6.7% |
| i | 51578 | 3.6% |
| n | 49358 | 3.5% |
| s | 48988 | 3.5% |
| Other values (16) | 343952 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1213304 | |
| Space Separator | 150294 | 10.6% |
| Uppercase Letter | 51319 | 3.6% |
| Decimal Number | 185 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 239316 | |
| r | 145410 | |
| a | 98716 | |
| t | 97902 | |
| h | 95312 | 7.9% |
| m | 94276 | 7.8% |
| i | 51578 | 4.3% |
| n | 49358 | 4.1% |
| s | 48988 | 4.0% |
| g | 48618 | 4.0% |
| Other values (8) | 243830 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 46768 | |
| M | 1110 | 2.2% |
| G | 1110 | 2.2% |
| R | 925 | 1.8% |
| W | 740 | 1.4% |
| O | 666 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 150294 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 185 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1264623 | |
| Common | 150479 | 10.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 239316 | |
| r | 145410 | |
| a | 98716 | 7.8% |
| t | 97902 | 7.7% |
| h | 95312 | 7.5% |
| m | 94276 | 7.5% |
| i | 51578 | 4.1% |
| n | 49358 | 3.9% |
| s | 48988 | 3.9% |
| g | 48618 | 3.8% |
| Other values (14) | 295149 |
Common
| Value | Count | Frequency (%) |
| 150294 | ||
| 1 | 185 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1415102 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 239316 | |
| 150294 | ||
| r | 145410 | |
| a | 98716 | 7.0% |
| t | 97902 | 6.9% |
| h | 95312 | 6.7% |
| m | 94276 | 6.7% |
| i | 51578 | 3.6% |
| n | 49358 | 3.5% |
| s | 48988 | 3.5% |
| Other values (16) | 343952 |
Status
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.9 MiB |
| Active |
|---|
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 4182702 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Active |
|---|---|
| 2nd row | Active |
| 3rd row | Active |
| 4th row | Active |
| 5th row | Active |
Common Values
| Value | Count | Frequency (%) |
| Active | 697117 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| active | 697117 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 697117 | |
| c | 697117 | |
| t | 697117 | |
| i | 697117 | |
| v | 697117 | |
| e | 697117 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3485585 | |
| Uppercase Letter | 697117 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 697117 | |
| t | 697117 | |
| i | 697117 | |
| v | 697117 | |
| e | 697117 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 697117 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4182702 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 697117 | |
| c | 697117 | |
| t | 697117 | |
| i | 697117 | |
| v | 697117 | |
| e | 697117 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4182702 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 697117 | |
| c | 697117 | |
| t | 697117 | |
| i | 697117 | |
| v | 697117 | |
| e | 697117 |
Franchisee.Guid
Categorical
| Distinct | 3639 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.8 MiB |
| 9f6bbb81-c544-e911-828d-0800274bb0e4 | 2220 |
|---|---|
| ef6e021c-2a79-ea11-833b-00155d326100 | 1850 |
| e292d672-aa56-e811-817a-0800274bb0e4 | 1406 |
| ade15bf6-4f4f-e711-80e2-005056815442 | 1369 |
| 89f720fc-edcc-eb11-8349-00155d326100 | 1221 |
| Other values (3634) |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Characters and Unicode
| Total characters | 25096212 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1e84c406-3deb-e911-8325-0800274bb0e4 |
|---|---|
| 2nd row | 68814817-0705-ea11-8329-0800274bb0e4 |
| 3rd row | 1e84c406-3deb-e911-8325-0800274bb0e4 |
| 4th row | 1e84c406-3deb-e911-8325-0800274bb0e4 |
| 5th row | 1e84c406-3deb-e911-8325-0800274bb0e4 |
Common Values
| Value | Count | Frequency (%) |
| 9f6bbb81-c544-e911-828d-0800274bb0e4 | 2220 | 0.3% |
| ef6e021c-2a79-ea11-833b-00155d326100 | 1850 | 0.3% |
| e292d672-aa56-e811-817a-0800274bb0e4 | 1406 | 0.2% |
| ade15bf6-4f4f-e711-80e2-005056815442 | 1369 | 0.2% |
| 89f720fc-edcc-eb11-8349-00155d326100 | 1221 | 0.2% |
| 006e23ea-36e3-e811-819a-0800274bb0e4 | 1221 | 0.2% |
| b87e51e8-d7d1-e811-8187-0800274bb0e4 | 1184 | 0.2% |
| 90b93781-4704-ea11-8329-0800274bb0e4 | 1147 | 0.2% |
| 708b55e3-1923-eb11-8345-00155d326100 | 1147 | 0.2% |
| 03aabb7a-b565-ea11-833b-00155d326100 | 1147 | 0.2% |
| Other values (3629) | 683205 |
Length
| Value | Count | Frequency (%) |
| 9f6bbb81-c544-e911-828d-0800274bb0e4 | 2220 | 0.3% |
| ef6e021c-2a79-ea11-833b-00155d326100 | 1850 | 0.3% |
| e292d672-aa56-e811-817a-0800274bb0e4 | 1406 | 0.2% |
| ade15bf6-4f4f-e711-80e2-005056815442 | 1369 | 0.2% |
| 89f720fc-edcc-eb11-8349-00155d326100 | 1221 | 0.2% |
| 006e23ea-36e3-e811-819a-0800274bb0e4 | 1221 | 0.2% |
| b87e51e8-d7d1-e811-8187-0800274bb0e4 | 1184 | 0.2% |
| 90b93781-4704-ea11-8329-0800274bb0e4 | 1147 | 0.2% |
| 708b55e3-1923-eb11-8345-00155d326100 | 1147 | 0.2% |
| 03aabb7a-b565-ea11-833b-00155d326100 | 1147 | 0.2% |
| Other values (3629) | 683205 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3343320 | |
| 1 | 2824802 | |
| - | 2788468 | |
| 8 | 1801789 | 7.2% |
| 5 | 1552446 | 6.2% |
| 4 | 1540458 | 6.1% |
| e | 1532725 | 6.1% |
| 2 | 1359935 | 5.4% |
| 3 | 1337402 | 5.3% |
| b | 1275908 | 5.1% |
| Other values (7) | 5738959 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16562865 | |
| Lowercase Letter | 5744879 | 22.9% |
| Dash Punctuation | 2788468 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3343320 | |
| 1 | 2824802 | |
| 8 | 1801789 | |
| 5 | 1552446 | |
| 4 | 1540458 | |
| 2 | 1359935 | |
| 3 | 1337402 | |
| 6 | 1065452 | 6.4% |
| 7 | 958115 | 5.8% |
| 9 | 779146 | 4.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1532725 | |
| b | 1275908 | |
| d | 994819 | |
| c | 726865 | |
| a | 671143 | |
| f | 543419 | 9.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2788468 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19351333 | |
| Latin | 5744879 | 22.9% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3343320 | |
| 1 | 2824802 | |
| - | 2788468 | |
| 8 | 1801789 | |
| 5 | 1552446 | |
| 4 | 1540458 | |
| 2 | 1359935 | |
| 3 | 1337402 | |
| 6 | 1065452 | 5.5% |
| 7 | 958115 | 5.0% |
Latin
| Value | Count | Frequency (%) |
| e | 1532725 | |
| b | 1275908 | |
| d | 994819 | |
| c | 726865 | |
| a | 671143 | |
| f | 543419 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25096212 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3343320 | |
| 1 | 2824802 | |
| - | 2788468 | |
| 8 | 1801789 | 7.2% |
| 5 | 1552446 | 6.2% |
| 4 | 1540458 | 6.1% |
| e | 1532725 | 6.1% |
| 2 | 1359935 | 5.4% |
| 3 | 1337402 | 5.3% |
| b | 1275908 | 5.1% |
| Other values (7) | 5738959 |
Caregiver.FullName
Categorical
| Distinct | 17723 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 MiB |
| 12 12 | 555 |
|---|---|
| Jabulile Gamede | 444 |
| Mukelisiwe Cele | 370 |
| 1 1 | 370 |
| Faith Mofulwane | 333 |
| Other values (17718) |
Length
| Max length | 55 |
|---|---|
| Median length | 41 |
| Mean length | 17.792049 |
| Min length | 4 |
Characters and Unicode
| Total characters | 12403140 |
|---|---|
| Distinct characters | 79 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Shieda Komani |
|---|---|
| 2nd row | Mpho Ramohlabi |
| 3rd row | Ragel De Bruin |
| 4th row | Leatitia Zona |
| 5th row | Esmeralda Klaaste |
Common Values
| Value | Count | Frequency (%) |
| 12 12 | 555 | 0.1% |
| Jabulile Gamede | 444 | 0.1% |
| Mukelisiwe Cele | 370 | 0.1% |
| 1 1 | 370 | 0.1% |
| Faith Mofulwane | 333 | < 0.1% |
| Mavis Macala | 296 | < 0.1% |
| Mandisa Kheswa | 259 | < 0.1% |
| Vinolia Phiri | 185 | < 0.1% |
| Griet Olifant | 185 | < 0.1% |
| n/a n/a | 148 | < 0.1% |
| Other values (17713) | 693972 |
Length
| Value | Count | Frequency (%) |
| dlamini | 7326 | 0.5% |
| maria | 6179 | 0.4% |
| ndlovu | 5698 | 0.4% |
| sithole | 5661 | 0.4% |
| ngubane | 4736 | 0.3% |
| mkhize | 4477 | 0.3% |
| lerato | 4329 | 0.3% |
| zanele | 4218 | 0.3% |
| khumalo | 4144 | 0.3% |
| mahlangu | 3959 | 0.3% |
| Other values (15180) | 1476448 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1356013 | 10.9% |
| e | 1030117 | 8.3% |
| 1020053 | 8.2% | |
| i | 867206 | 7.0% |
| o | 789469 | 6.4% |
| n | 698338 | 5.6% |
| 697117 | 5.6% | |
| l | 657786 | 5.3% |
| h | 470825 | 3.8% |
| s | 382765 | 3.1% |
| Other values (69) | 4433451 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9088421 | |
| Space Separator | 1717170 | 13.8% |
| Uppercase Letter | 1562399 | 12.6% |
| Decimal Number | 29563 | 0.2% |
| Dash Punctuation | 3441 | < 0.1% |
| Other Punctuation | 1998 | < 0.1% |
| Open Punctuation | 74 | < 0.1% |
| Close Punctuation | 74 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1356013 | |
| e | 1030117 | |
| i | 867206 | |
| o | 789469 | 8.7% |
| n | 698338 | 7.7% |
| l | 657786 | 7.2% |
| h | 470825 | 5.2% |
| s | 382765 | 4.2% |
| u | 356606 | 3.9% |
| t | 355163 | 3.9% |
| Other values (21) | 2124133 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 355977 | |
| N | 223110 | |
| S | 140045 | 9.0% |
| T | 87505 | 5.6% |
| K | 73630 | 4.7% |
| L | 72187 | 4.6% |
| P | 67636 | 4.3% |
| B | 67340 | 4.3% |
| A | 61790 | 4.0% |
| D | 55278 | 3.5% |
| Other values (17) | 357901 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6549 | |
| 1 | 5180 | |
| 8 | 3959 | |
| 2 | 3737 | |
| 7 | 2220 | 7.5% |
| 9 | 2035 | 6.9% |
| 6 | 1591 | 5.4% |
| 5 | 1517 | 5.1% |
| 3 | 1406 | 4.8% |
| 4 | 1369 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 962 | |
| ' | 370 | 18.5% |
| / | 333 | 16.7% |
| , | 148 | 7.4% |
| ? | 148 | 7.4% |
| & | 37 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 1020053 | ||
| 697117 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3441 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 74 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 74 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10650820 | |
| Common | 1752320 | 14.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1356013 | 12.7% |
| e | 1030117 | 9.7% |
| i | 867206 | 8.1% |
| o | 789469 | 7.4% |
| n | 698338 | 6.6% |
| l | 657786 | 6.2% |
| h | 470825 | 4.4% |
| s | 382765 | 3.6% |
| u | 356606 | 3.3% |
| M | 355977 | 3.3% |
| Other values (48) | 3685718 |
Common
| Value | Count | Frequency (%) |
| 1020053 | ||
| 697117 | ||
| 0 | 6549 | 0.4% |
| 1 | 5180 | 0.3% |
| 8 | 3959 | 0.2% |
| 2 | 3737 | 0.2% |
| - | 3441 | 0.2% |
| 7 | 2220 | 0.1% |
| 9 | 2035 | 0.1% |
| 6 | 1591 | 0.1% |
| Other values (11) | 6438 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11705801 | |
| None | 697339 | 5.6% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1356013 | 11.6% |
| e | 1030117 | 8.8% |
| 1020053 | 8.7% | |
| i | 867206 | 7.4% |
| o | 789469 | 6.7% |
| n | 698338 | 6.0% |
| l | 657786 | 5.6% |
| h | 470825 | 4.0% |
| s | 382765 | 3.3% |
| u | 356606 | 3.0% |
| Other values (62) | 4076623 |
None
| Value | Count | Frequency (%) |
| 697117 | ||
| ź | 37 | < 0.1% |
| é | 37 | < 0.1% |
| ĺ | 37 | < 0.1% |
| ë | 37 | < 0.1% |
| Á | 37 | < 0.1% |
| ñ | 37 | < 0.1% |
Caregiver.FirstName
Categorical
| Distinct | 9230 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 185 |
| Missing (%) | < 0.1% |
| Memory size | 43.9 MiB |
| Maria | 2664 |
|---|---|
| Lerato | 2553 |
| Zanele | 2294 |
| Mpho | 2294 |
| Nthabiseng | 2294 |
| Other values (9225) |
Length
| Max length | 47 |
|---|---|
| Median length | 33 |
| Mean length | 9.0123699 |
| Min length | 1 |
Characters and Unicode
| Total characters | 6281009 |
|---|---|
| Distinct characters | 75 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Shieda |
|---|---|
| 2nd row | Mpho |
| 3rd row | Ragel |
| 4th row | Leatitia |
| 5th row | Esmeralda |
Common Values
| Value | Count | Frequency (%) |
| Maria | 2664 | 0.4% |
| Lerato | 2553 | 0.4% |
| Zanele | 2294 | 0.3% |
| Mpho | 2294 | 0.3% |
| Nthabiseng | 2294 | 0.3% |
| Zandile | 2109 | 0.3% |
| Nokuthula | 1961 | 0.3% |
| Amanda | 1961 | 0.3% |
| Siphokazi | 1961 | 0.3% |
| Andiswa | 1813 | 0.3% |
| Other values (9220) | 675028 |
Length
| Value | Count | Frequency (%) |
| maria | 6142 | 0.7% |
| lerato | 4329 | 0.5% |
| zanele | 4218 | 0.5% |
| nthabiseng | 3848 | 0.5% |
| mpho | 3700 | 0.4% |
| thandeka | 3700 | 0.4% |
| zandile | 3515 | 0.4% |
| nokuthula | 3256 | 0.4% |
| portia | 3145 | 0.4% |
| nonhlanhla | 3071 | 0.4% |
| Other values (7005) | 784918 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 660265 | 10.5% |
| e | 619824 | 9.9% |
| i | 568320 | 9.0% |
| o | 447219 | 7.1% |
| n | 410885 | 6.5% |
| l | 387242 | 6.2% |
| h | 275428 | 4.4% |
| 264550 | 4.2% | |
| s | 230547 | 3.7% |
| t | 205905 | 3.3% |
| Other values (65) | 2210824 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5157134 | |
| Uppercase Letter | 839123 | 13.4% |
| Space Separator | 264550 | 4.2% |
| Decimal Number | 16021 | 0.3% |
| Dash Punctuation | 2664 | < 0.1% |
| Other Punctuation | 1443 | < 0.1% |
| Open Punctuation | 37 | < 0.1% |
| Close Punctuation | 37 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 660265 | |
| e | 619824 | |
| i | 568320 | |
| o | 447219 | 8.7% |
| n | 410885 | 8.0% |
| l | 387242 | 7.5% |
| h | 275428 | 5.3% |
| s | 230547 | 4.5% |
| t | 205905 | 4.0% |
| u | 182077 | 3.5% |
| Other values (19) | 1169422 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 139342 | |
| M | 96607 | |
| S | 75110 | 9.0% |
| T | 55907 | 6.7% |
| L | 51541 | 6.1% |
| A | 49580 | 5.9% |
| P | 49580 | 5.9% |
| B | 42328 | 5.0% |
| K | 36075 | 4.3% |
| Z | 30525 | 3.6% |
| Other values (17) | 212528 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3589 | |
| 1 | 2516 | |
| 8 | 2072 | |
| 2 | 2035 | |
| 7 | 1406 | 8.8% |
| 6 | 999 | 6.2% |
| 9 | 962 | 6.0% |
| 5 | 888 | 5.5% |
| 3 | 814 | 5.1% |
| 4 | 740 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 814 | |
| ' | 296 | 20.5% |
| , | 148 | 10.3% |
| ? | 148 | 10.3% |
| & | 37 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 264550 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2664 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 37 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5996257 | |
| Common | 284752 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 660265 | 11.0% |
| e | 619824 | 10.3% |
| i | 568320 | 9.5% |
| o | 447219 | 7.5% |
| n | 410885 | 6.9% |
| l | 387242 | 6.5% |
| h | 275428 | 4.6% |
| s | 230547 | 3.8% |
| t | 205905 | 3.4% |
| u | 182077 | 3.0% |
| Other values (46) | 2008545 |
Common
| Value | Count | Frequency (%) |
| 264550 | ||
| 0 | 3589 | 1.3% |
| - | 2664 | 0.9% |
| 1 | 2516 | 0.9% |
| 8 | 2072 | 0.7% |
| 2 | 2035 | 0.7% |
| 7 | 1406 | 0.5% |
| 6 | 999 | 0.4% |
| 9 | 962 | 0.3% |
| 5 | 888 | 0.3% |
| Other values (9) | 3071 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6280861 | |
| None | 148 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 660265 | 10.5% |
| e | 619824 | 9.9% |
| i | 568320 | 9.0% |
| o | 447219 | 7.1% |
| n | 410885 | 6.5% |
| l | 387242 | 6.2% |
| h | 275428 | 4.4% |
| 264550 | 4.2% | |
| s | 230547 | 3.7% |
| t | 205905 | 3.3% |
| Other values (61) | 2210676 |
None
| Value | Count | Frequency (%) |
| ĺ | 37 | |
| Á | 37 | |
| é | 37 | |
| ź | 37 |
Caregiver.Surname
Categorical
| Distinct | 9578 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 185 |
| Missing (%) | < 0.1% |
| Memory size | 42.4 MiB |
| Dlamini | 6401 |
|---|---|
| Ndlovu | 4329 |
| Sithole | 3700 |
| Mahlangu | 3367 |
| Khumalo | 3145 |
| Other values (9573) |
Length
| Max length | 23 |
|---|---|
| Median length | 20 |
| Mean length | 6.782491 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4726935 |
|---|---|
| Distinct characters | 71 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Komani |
|---|---|
| 2nd row | Ramohlabi |
| 3rd row | De Bruin |
| 4th row | Zona |
| 5th row | Klaaste |
Common Values
| Value | Count | Frequency (%) |
| Dlamini | 6401 | 0.9% |
| Ndlovu | 4329 | 0.6% |
| Sithole | 3700 | 0.5% |
| Mahlangu | 3367 | 0.5% |
| Khumalo | 3145 | 0.5% |
| Mkhize | 2849 | 0.4% |
| Ngubane | 2553 | 0.4% |
| Mokoena | 2368 | 0.3% |
| Zulu | 2294 | 0.3% |
| Mbatha | 2109 | 0.3% |
| Other values (9568) | 663817 |
Length
| Value | Count | Frequency (%) |
| dlamini | 6845 | 1.0% |
| sithole | 5439 | 0.8% |
| ndlovu | 5439 | 0.8% |
| ngubane | 4699 | 0.7% |
| mkhize | 4403 | 0.6% |
| khumalo | 3959 | 0.6% |
| mahlangu | 3922 | 0.6% |
| mbatha | 3145 | 0.4% |
| zulu | 2812 | 0.4% |
| dladla | 2775 | 0.4% |
| Other values (8956) | 659562 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 695452 | |
| e | 410293 | 8.7% |
| o | 342250 | 7.2% |
| i | 298886 | 6.3% |
| n | 287157 | 6.1% |
| l | 270544 | 5.7% |
| M | 259370 | 5.5% |
| h | 195397 | 4.1% |
| u | 174529 | 3.7% |
| s | 152218 | 3.2% |
| Other values (61) | 1640839 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3930695 | |
| Uppercase Letter | 723202 | 15.3% |
| Space Separator | 58386 | 1.2% |
| Decimal Number | 13542 | 0.3% |
| Dash Punctuation | 777 | < 0.1% |
| Other Punctuation | 259 | < 0.1% |
| Open Punctuation | 37 | < 0.1% |
| Close Punctuation | 37 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 695452 | |
| e | 410293 | |
| o | 342250 | 8.7% |
| i | 298886 | 7.6% |
| n | 287157 | 7.3% |
| l | 270544 | 6.9% |
| h | 195397 | 5.0% |
| u | 174529 | 4.4% |
| s | 152218 | 3.9% |
| t | 149258 | 3.8% |
| Other values (18) | 954711 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 259370 | |
| N | 83731 | 11.6% |
| S | 64935 | 9.0% |
| K | 37555 | 5.2% |
| T | 31598 | 4.4% |
| D | 31450 | 4.3% |
| B | 25012 | 3.5% |
| L | 20646 | 2.9% |
| G | 19721 | 2.7% |
| P | 18056 | 2.5% |
| Other values (16) | 131128 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2960 | |
| 1 | 2664 | |
| 8 | 1887 | |
| 2 | 1702 | |
| 9 | 1073 | 7.9% |
| 7 | 814 | 6.0% |
| 5 | 629 | 4.6% |
| 4 | 629 | 4.6% |
| 6 | 592 | 4.4% |
| 3 | 592 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 148 | |
| ' | 74 | |
| / | 37 | 14.3% |
Space Separator
| Value | Count | Frequency (%) |
| 58386 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 777 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 37 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4653897 | |
| Common | 73038 | 1.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 695452 | |
| e | 410293 | 8.8% |
| o | 342250 | 7.4% |
| i | 298886 | 6.4% |
| n | 287157 | 6.2% |
| l | 270544 | 5.8% |
| M | 259370 | 5.6% |
| h | 195397 | 4.2% |
| u | 174529 | 3.8% |
| s | 152218 | 3.3% |
| Other values (44) | 1567801 |
Common
| Value | Count | Frequency (%) |
| 58386 | ||
| 0 | 2960 | 4.1% |
| 1 | 2664 | 3.6% |
| 8 | 1887 | 2.6% |
| 2 | 1702 | 2.3% |
| 9 | 1073 | 1.5% |
| 7 | 814 | 1.1% |
| - | 777 | 1.1% |
| 5 | 629 | 0.9% |
| 4 | 629 | 0.9% |
| Other values (7) | 1517 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4726861 | |
| None | 74 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 695452 | |
| e | 410293 | 8.7% |
| o | 342250 | 7.2% |
| i | 298886 | 6.3% |
| n | 287157 | 6.1% |
| l | 270544 | 5.7% |
| M | 259370 | 5.5% |
| h | 195397 | 4.1% |
| u | 174529 | 3.7% |
| s | 152218 | 3.2% |
| Other values (59) | 1640765 |
None
| Value | Count | Frequency (%) |
| ë | 37 | |
| ñ | 37 |
Caregiver.IdNumber
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 15284 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 90058 |
| Missing (%) | 12.9% |
| Memory size | 43.1 MiB |
| 0000000000000 | 1221 |
|---|---|
| 12 | 555 |
| 0 | 481 |
| 000000000000 | 481 |
| 6702250614083 | 444 |
| Other values (15279) |
Length
| Max length | 17 |
|---|---|
| Median length | 13 |
| Mean length | 12.784056 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7760676 |
|---|---|
| Distinct characters | 48 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 9511070255085 |
|---|---|
| 2nd row | 6004300574080 |
| 3rd row | 8612290232085 |
| 4th row | 9801020149086 |
| 5th row | 0010220038086 |
Common Values
| Value | Count | Frequency (%) |
| 0000000000000 | 1221 | 0.2% |
| 12 | 555 | 0.1% |
| 0 | 481 | 0.1% |
| 000000000000 | 481 | 0.1% |
| 6702250614083 | 444 | 0.1% |
| 0000000000 | 407 | 0.1% |
| 1 | 370 | 0.1% |
| 0000000000012 | 370 | 0.1% |
| 8207271099080 | 333 | < 0.1% |
| 8312221057087 | 259 | < 0.1% |
| Other values (15274) | 602138 | |
| (Missing) | 90058 | 12.9% |
Length
| Value | Count | Frequency (%) |
| 0000000000000 | 1221 | 0.2% |
| 12 | 555 | 0.1% |
| 0 | 481 | 0.1% |
| 000000000000 | 481 | 0.1% |
| 6702250614083 | 444 | 0.1% |
| 0000000000 | 407 | 0.1% |
| 1 | 370 | 0.1% |
| 0000000000012 | 370 | 0.1% |
| none | 333 | 0.1% |
| 8207271099080 | 333 | 0.1% |
| Other values (15297) | 603766 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2082878 | |
| 8 | 1183482 | |
| 1 | 979020 | |
| 9 | 675287 | 8.7% |
| 2 | 652236 | 8.4% |
| 7 | 451437 | 5.8% |
| 3 | 439486 | 5.7% |
| 5 | 437969 | 5.6% |
| 6 | 434306 | 5.6% |
| 4 | 409960 | 5.3% |
| Other values (38) | 14615 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7746061 | |
| Uppercase Letter | 8325 | 0.1% |
| Space Separator | 1850 | < 0.1% |
| Dash Punctuation | 1591 | < 0.1% |
| Lowercase Letter | 1554 | < 0.1% |
| Other Punctuation | 1184 | < 0.1% |
| Connector Punctuation | 74 | < 0.1% |
| Modifier Symbol | 37 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1998 | |
| A | 1147 | |
| C | 962 | |
| M | 703 | 8.4% |
| R | 518 | 6.2% |
| D | 481 | 5.8% |
| B | 407 | 4.9% |
| F | 370 | 4.4% |
| E | 333 | 4.0% |
| T | 296 | 3.6% |
| Other values (11) | 1110 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2082878 | |
| 8 | 1183482 | |
| 1 | 979020 | |
| 9 | 675287 | 8.7% |
| 2 | 652236 | 8.4% |
| 7 | 451437 | 5.8% |
| 3 | 439486 | 5.7% |
| 5 | 437969 | 5.7% |
| 6 | 434306 | 5.6% |
| 4 | 409960 | 5.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 481 | |
| e | 370 | |
| o | 370 | |
| a | 111 | 7.1% |
| m | 37 | 2.4% |
| w | 37 | 2.4% |
| i | 37 | 2.4% |
| s | 37 | 2.4% |
| k | 37 | 2.4% |
| r | 37 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1110 | |
| . | 37 | 3.1% |
| * | 37 | 3.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1850 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1591 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 74 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7750797 | |
| Latin | 9879 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1998 | |
| A | 1147 | |
| C | 962 | 9.7% |
| M | 703 | 7.1% |
| R | 518 | 5.2% |
| n | 481 | 4.9% |
| D | 481 | 4.9% |
| B | 407 | 4.1% |
| F | 370 | 3.7% |
| e | 370 | 3.7% |
| Other values (21) | 2442 |
Common
| Value | Count | Frequency (%) |
| 0 | 2082878 | |
| 8 | 1183482 | |
| 1 | 979020 | |
| 9 | 675287 | 8.7% |
| 2 | 652236 | 8.4% |
| 7 | 451437 | 5.8% |
| 3 | 439486 | 5.7% |
| 5 | 437969 | 5.7% |
| 6 | 434306 | 5.6% |
| 4 | 409960 | 5.3% |
| Other values (7) | 4736 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7760676 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2082878 | |
| 8 | 1183482 | |
| 1 | 979020 | |
| 9 | 675287 | 8.7% |
| 2 | 652236 | 8.4% |
| 7 | 451437 | 5.8% |
| 3 | 439486 | 5.7% |
| 5 | 437969 | 5.6% |
| 6 | 434306 | 5.6% |
| 4 | 409960 | 5.3% |
| Other values (38) | 14615 | 0.2% |
Caregiver.ContactNumber
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 11312 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 227587 |
| Missing (%) | 32.6% |
| Memory size | 36.9 MiB |
| 0 | 4329 |
|---|---|
| 0000000000 | 2701 |
| 0681145763 | 1850 |
| None | 1295 |
| 00000000 | 1147 |
| Other values (11307) |
Length
| Max length | 22 |
|---|---|
| Median length | 10 |
| Mean length | 9.8550039 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4627220 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0635118027 |
|---|---|
| 2nd row | 0780401410 |
| 3rd row | 0651332965 |
| 4th row | 0719602840 |
| 5th row | 0632598548 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 4329 | 0.6% |
| 0000000000 | 2701 | 0.4% |
| 0681145763 | 1850 | 0.3% |
| None | 1295 | 0.2% |
| 00000000 | 1147 | 0.2% |
| 000000000 | 962 | 0.1% |
| 0661469720 | 481 | 0.1% |
| 0761475377 | 444 | 0.1% |
| 0479392931 | 370 | 0.1% |
| + | 370 | 0.1% |
| Other values (11302) | 455581 | |
| (Missing) | 227587 |
Length
| Value | Count | Frequency (%) |
| 0 | 4329 | 0.9% |
| 0000000000 | 2701 | 0.6% |
| 0681145763 | 1850 | 0.4% |
| none | 1554 | 0.3% |
| 00000000 | 1147 | 0.2% |
| 000000000 | 962 | 0.2% |
| 0661469720 | 481 | 0.1% |
| 0761475377 | 444 | 0.1% |
| 0479392931 | 370 | 0.1% |
| 370 | 0.1% | |
| Other values (11318) | 456284 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 860583 | |
| 7 | 599067 | |
| 6 | 489362 | |
| 3 | 429163 | |
| 8 | 424945 | |
| 2 | 393939 | |
| 1 | 386724 | |
| 4 | 357642 | |
| 9 | 351648 | |
| 5 | 324823 | 7.0% |
| Other values (23) | 9324 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4617896 | |
| Lowercase Letter | 5624 | 0.1% |
| Uppercase Letter | 1813 | < 0.1% |
| Space Separator | 1184 | < 0.1% |
| Math Symbol | 370 | < 0.1% |
| Other Punctuation | 296 | < 0.1% |
| Dash Punctuation | 37 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1628 | |
| n | 1628 | |
| o | 1628 | |
| a | 185 | 3.3% |
| i | 111 | 2.0% |
| u | 111 | 2.0% |
| s | 74 | 1.3% |
| q | 74 | 1.3% |
| r | 37 | 0.7% |
| b | 37 | 0.7% |
| Other values (3) | 111 | 2.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 860583 | |
| 7 | 599067 | |
| 6 | 489362 | |
| 3 | 429163 | |
| 8 | 424945 | |
| 2 | 393939 | |
| 1 | 386724 | |
| 4 | 357642 | |
| 9 | 351648 | |
| 5 | 324823 | 7.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1591 | |
| O | 111 | 6.1% |
| H | 37 | 2.0% |
| B | 37 | 2.0% |
| Y | 37 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 259 | |
| \ | 37 | 12.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1184 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 370 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4619783 | |
| Latin | 7437 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1628 | |
| n | 1628 | |
| o | 1628 | |
| N | 1591 | |
| a | 185 | 2.5% |
| O | 111 | 1.5% |
| i | 111 | 1.5% |
| u | 111 | 1.5% |
| s | 74 | 1.0% |
| q | 74 | 1.0% |
| Other values (8) | 296 | 4.0% |
Common
| Value | Count | Frequency (%) |
| 0 | 860583 | |
| 7 | 599067 | |
| 6 | 489362 | |
| 3 | 429163 | |
| 8 | 424945 | |
| 2 | 393939 | |
| 1 | 386724 | |
| 4 | 357642 | |
| 9 | 351648 | |
| 5 | 324823 | 7.0% |
| Other values (5) | 1887 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4627220 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 860583 | |
| 7 | 599067 | |
| 6 | 489362 | |
| 3 | 429163 | |
| 8 | 424945 | |
| 2 | 393939 | |
| 1 | 386724 | |
| 4 | 357642 | |
| 9 | 351648 | |
| 5 | 324823 | 7.0% |
| Other values (23) | 9324 | 0.2% |
Caregiver.RelationshipType
Categorical
IMBALANCE  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 278832 |
| Missing (%) | 40.0% |
| Memory size | 33.7 MiB |
| Mother | |
|---|---|
| Guardian | 26159 |
| Father | 11581 |
| Grandparent | 9509 |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 6.2387439 |
| Min length | 6 |
Characters and Unicode
| Total characters | 2609573 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mother |
|---|---|
| 2nd row | Mother |
| 3rd row | Mother |
| 4th row | Mother |
| 5th row | Mother |
Common Values
| Value | Count | Frequency (%) |
| Mother | 371036 | |
| Guardian | 26159 | 3.8% |
| Father | 11581 | 1.7% |
| Grandparent | 9509 | 1.4% |
| (Missing) | 278832 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| mother | 371036 | |
| guardian | 26159 | 6.3% |
| father | 11581 | 2.8% |
| grandparent | 9509 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 427794 | |
| t | 392126 | |
| e | 392126 | |
| h | 382617 | |
| M | 371036 | |
| o | 371036 | |
| a | 82917 | 3.2% |
| n | 45177 | 1.7% |
| G | 35668 | 1.4% |
| d | 35668 | 1.4% |
| Other values (4) | 73408 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2191288 | |
| Uppercase Letter | 418285 | 16.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 427794 | |
| t | 392126 | |
| e | 392126 | |
| h | 382617 | |
| o | 371036 | |
| a | 82917 | 3.8% |
| n | 45177 | 2.1% |
| d | 35668 | 1.6% |
| u | 26159 | 1.2% |
| i | 26159 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 371036 | |
| G | 35668 | 8.5% |
| F | 11581 | 2.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2609573 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 427794 | |
| t | 392126 | |
| e | 392126 | |
| h | 382617 | |
| M | 371036 | |
| o | 371036 | |
| a | 82917 | 3.2% |
| n | 45177 | 1.7% |
| G | 35668 | 1.4% |
| d | 35668 | 1.4% |
| Other values (4) | 73408 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2609573 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 427794 | |
| t | 392126 | |
| e | 392126 | |
| h | 382617 | |
| M | 371036 | |
| o | 371036 | |
| a | 82917 | 3.2% |
| n | 45177 | 1.7% |
| G | 35668 | 1.4% |
| d | 35668 | 1.4% |
| Other values (4) | 73408 | 2.8% |
Caregiver.HighestEducationLevel
Categorical
IMBALANCE  MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 519591 |
| Missing (%) | 74.5% |
| Memory size | 27.0 MiB |
| No Matric | |
|---|---|
| Diploma | 2627 |
| NQF Level 4 ECD | 1850 |
| Higher Certificate | 1332 |
| Bachelors | 740 |
Length
| Max length | 18 |
|---|---|
| Median length | 9 |
| Mean length | 9.1004585 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1615568 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Matric |
|---|---|
| 2nd row | No Matric |
| 3rd row | No Matric |
| 4th row | No Matric |
| 5th row | No Matric |
Common Values
| Value | Count | Frequency (%) |
| No Matric | 170940 | 24.5% |
| Diploma | 2627 | 0.4% |
| NQF Level 4 ECD | 1850 | 0.3% |
| Higher Certificate | 1332 | 0.2% |
| Bachelors | 740 | 0.1% |
| Doctorate | 37 | < 0.1% |
| (Missing) | 519591 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 170940 | |
| matric | 170940 | |
| diploma | 2627 | 0.7% |
| nqf | 1850 | 0.5% |
| level | 1850 | 0.5% |
| 4 | 1850 | 0.5% |
| ecd | 1850 | 0.5% |
| higher | 1332 | 0.4% |
| certificate | 1332 | 0.4% |
| bachelors | 740 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 177822 | ||
| i | 177563 | |
| a | 175676 | |
| r | 174381 | |
| o | 174381 | |
| t | 173678 | |
| c | 173049 | |
| N | 172790 | |
| M | 170940 | |
| e | 8473 | 0.5% |
| Other values (17) | 36815 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1074998 | |
| Uppercase Letter | 360898 | 22.3% |
| Space Separator | 177822 | 11.0% |
| Decimal Number | 1850 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 177563 | |
| a | 175676 | |
| r | 174381 | |
| o | 174381 | |
| t | 173678 | |
| c | 173049 | |
| e | 8473 | 0.8% |
| l | 5217 | 0.5% |
| m | 2627 | 0.2% |
| p | 2627 | 0.2% |
| Other values (5) | 7326 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 172790 | |
| M | 170940 | |
| D | 4514 | 1.3% |
| C | 3182 | 0.9% |
| E | 1850 | 0.5% |
| Q | 1850 | 0.5% |
| L | 1850 | 0.5% |
| F | 1850 | 0.5% |
| H | 1332 | 0.4% |
| B | 740 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 177822 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1850 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1435896 | |
| Common | 179672 | 11.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 177563 | |
| a | 175676 | |
| r | 174381 | |
| o | 174381 | |
| t | 173678 | |
| c | 173049 | |
| N | 172790 | |
| M | 170940 | |
| e | 8473 | 0.6% |
| l | 5217 | 0.4% |
| Other values (15) | 29748 | 2.1% |
Common
| Value | Count | Frequency (%) |
| 177822 | ||
| 4 | 1850 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1615568 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 177822 | ||
| i | 177563 | |
| a | 175676 | |
| r | 174381 | |
| o | 174381 | |
| t | 173678 | |
| c | 173049 | |
| N | 172790 | |
| M | 170940 | |
| e | 8473 | 0.5% |
| Other values (17) | 36815 | 2.3% |
Caregiver.Language
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 676212 |
| Missing (%) | 97.0% |
| Memory size | 21.9 MiB |
| isiZulu | |
|---|---|
| isiXhosa | |
| Setswana | |
| Afrikaans | |
| Sepedi | |
| Other values (6) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 7.5026549 |
| Min length | 6 |
Characters and Unicode
| Total characters | 156843 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | isiXhosa |
|---|---|
| 2nd row | Afrikaans |
| 3rd row | Afrikaans |
| 4th row | Afrikaans |
| 5th row | Afrikaans |
Common Values
| Value | Count | Frequency (%) |
| isiZulu | 7363 | 1.1% |
| isiXhosa | 3515 | 0.5% |
| Setswana | 3256 | 0.5% |
| Afrikaans | 2183 | 0.3% |
| Sepedi | 2035 | 0.3% |
| siSwati | 777 | 0.1% |
| Sesotho | 666 | 0.1% |
| Xitsonga | 481 | 0.1% |
| Tshivenda | 296 | < 0.1% |
| English | 222 | < 0.1% |
| (Missing) | 676212 |
Length
| Value | Count | Frequency (%) |
| isizulu | 7363 | |
| isixhosa | 3515 | |
| setswana | 3256 | |
| afrikaans | 2183 | 10.4% |
| sepedi | 2035 | 9.7% |
| siswati | 777 | 3.7% |
| sesotho | 666 | 3.2% |
| xitsonga | 481 | 2.3% |
| tshivenda | 296 | 1.4% |
| english | 222 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 28749 | |
| s | 22385 | |
| a | 15947 | |
| u | 14726 | |
| e | 8621 | 5.5% |
| l | 7696 | 4.9% |
| Z | 7363 | 4.7% |
| S | 6734 | 4.3% |
| n | 6438 | 4.1% |
| o | 5328 | 3.4% |
| Other values (16) | 32856 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 135938 | |
| Uppercase Letter | 20905 | 13.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 28749 | |
| s | 22385 | |
| a | 15947 | |
| u | 14726 | |
| e | 8621 | 6.3% |
| l | 7696 | 5.7% |
| n | 6438 | 4.7% |
| o | 5328 | 3.9% |
| t | 5180 | 3.8% |
| h | 4699 | 3.5% |
| Other values (9) | 16169 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Z | 7363 | |
| S | 6734 | |
| X | 3996 | |
| A | 2183 | 10.4% |
| T | 296 | 1.4% |
| E | 222 | 1.1% |
| N | 111 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 156843 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 28749 | |
| s | 22385 | |
| a | 15947 | |
| u | 14726 | |
| e | 8621 | 5.5% |
| l | 7696 | 4.9% |
| Z | 7363 | 4.7% |
| S | 6734 | 4.3% |
| n | 6438 | 4.1% |
| o | 5328 | 3.4% |
| Other values (16) | 32856 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 156843 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 28749 | |
| s | 22385 | |
| a | 15947 | |
| u | 14726 | |
| e | 8621 | 5.5% |
| l | 7696 | 4.9% |
| Z | 7363 | 4.7% |
| S | 6734 | 4.3% |
| n | 6438 | 4.1% |
| o | 5328 | 3.4% |
| Other values (16) | 32856 |
Caregiver.Guid
Categorical
| Distinct | 18103 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.8 MiB |
| 69516fc1-1271-eb11-8345-00155d326100 | 555 |
|---|---|
| 87ab4728-bc13-ec11-834c-00155d326100 | 444 |
| 339b9917-12ab-ea11-833e-00155d326100 | 370 |
| e2f323de-ca74-ea11-833b-00155d326100 | 370 |
| 2d1a67d8-0b95-ea11-833c-00155d326100 | 333 |
| Other values (18098) |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Characters and Unicode
| Total characters | 25096212 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3b69f593-3d43-ea11-8330-080027a7109a |
|---|---|
| 2nd row | 581e6f1a-bc45-ea11-833a-00155d326100 |
| 3rd row | f32af543-b745-ea11-833a-00155d326100 |
| 4th row | e99b7a2c-f945-ea11-833a-00155d326100 |
| 5th row | 1254a16f-4f46-ea11-833a-00155d326100 |
Common Values
| Value | Count | Frequency (%) |
| 69516fc1-1271-eb11-8345-00155d326100 | 555 | 0.1% |
| 87ab4728-bc13-ec11-834c-00155d326100 | 444 | 0.1% |
| 339b9917-12ab-ea11-833e-00155d326100 | 370 | 0.1% |
| e2f323de-ca74-ea11-833b-00155d326100 | 370 | 0.1% |
| 2d1a67d8-0b95-ea11-833c-00155d326100 | 333 | < 0.1% |
| adf213d5-5e83-ec11-8350-00155d326100 | 259 | < 0.1% |
| e82a1a5b-e8d4-eb11-8349-00155d326100 | 185 | < 0.1% |
| a97e0cef-ba9a-eb11-8346-00155d326100 | 148 | < 0.1% |
| 642bd778-4d7f-eb11-8346-00155d326100 | 148 | < 0.1% |
| 512b9642-8d09-ec11-834c-00155d326100 | 148 | < 0.1% |
| Other values (18093) | 694157 |
Length
| Value | Count | Frequency (%) |
| 69516fc1-1271-eb11-8345-00155d326100 | 555 | 0.1% |
| 87ab4728-bc13-ec11-834c-00155d326100 | 444 | 0.1% |
| 339b9917-12ab-ea11-833e-00155d326100 | 370 | 0.1% |
| e2f323de-ca74-ea11-833b-00155d326100 | 370 | 0.1% |
| 2d1a67d8-0b95-ea11-833c-00155d326100 | 333 | < 0.1% |
| adf213d5-5e83-ec11-8350-00155d326100 | 259 | < 0.1% |
| e82a1a5b-e8d4-eb11-8349-00155d326100 | 185 | < 0.1% |
| 3f6bb4ba-ff78-ec11-834d-00155d326100 | 148 | < 0.1% |
| 2a29e3e1-6567-ea11-833b-00155d326100 | 148 | < 0.1% |
| a5f94145-4e99-ec11-8351-00155d326100 | 148 | < 0.1% |
| Other values (18093) | 694157 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3389718 | |
| 1 | 3334070 | |
| - | 2788468 | |
| 5 | 2124096 | |
| 3 | 1947939 | 7.8% |
| 8 | 1330668 | 5.3% |
| 6 | 1300957 | 5.2% |
| d | 1280385 | 5.1% |
| e | 1206866 | 4.8% |
| 2 | 1204905 | 4.8% |
| Other values (7) | 5188140 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16921469 | |
| Lowercase Letter | 5386275 | 21.5% |
| Dash Punctuation | 2788468 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3389718 | |
| 1 | 3334070 | |
| 5 | 2124096 | |
| 3 | 1947939 | |
| 8 | 1330668 | 7.9% |
| 6 | 1300957 | 7.7% |
| 2 | 1204905 | 7.1% |
| 4 | 987456 | 5.8% |
| 9 | 711251 | 4.2% |
| 7 | 590409 | 3.5% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 1280385 | |
| e | 1206866 | |
| b | 883523 | |
| c | 856772 | |
| a | 656824 | |
| f | 501905 | 9.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2788468 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19709937 | |
| Latin | 5386275 | 21.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3389718 | |
| 1 | 3334070 | |
| - | 2788468 | |
| 5 | 2124096 | |
| 3 | 1947939 | |
| 8 | 1330668 | 6.8% |
| 6 | 1300957 | 6.6% |
| 2 | 1204905 | 6.1% |
| 4 | 987456 | 5.0% |
| 9 | 711251 | 3.6% |
Latin
| Value | Count | Frequency (%) |
| d | 1280385 | |
| e | 1206866 | |
| b | 883523 | |
| c | 856772 | |
| a | 656824 | |
| f | 501905 | 9.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25096212 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3389718 | |
| 1 | 3334070 | |
| - | 2788468 | |
| 5 | 2124096 | |
| 3 | 1947939 | 7.8% |
| 8 | 1330668 | 5.3% |
| 6 | 1300957 | 5.2% |
| d | 1280385 | 5.1% |
| e | 1206866 | 4.8% |
| 2 | 1204905 | 4.8% |
| Other values (7) | 5188140 |
| Unnamed: 0 | Guid | FullName | FirstName | Surname | IdNumber | AllergyType | DisabilityType | HealthConditions | EmergencyContactNumber | EmergencyContactFullName | EmergencyContactFirstName | EmergencyContactSurname | AlternativePickupFirstName | AlternativePickupSurname | AlternativePickupContactNumber | BirthDate | StartDate | HasAllergy | HasDisability | CaregiverPopiaConsent | CaregiverPhotographyAndFilmingConsent | IsSouthAfricanCitizen | HasIdNumber | Gender | EthnicGroup | HomeLanguage | GrantType | PlaygroupGroup | InactiveReason | Status | Franchisee.Guid | Caregiver.FullName | Caregiver.FirstName | Caregiver.Surname | Caregiver.IdNumber | Caregiver.ContactNumber | Caregiver.RelationshipType | Caregiver.HighestEducationLevel | Caregiver.Language | Caregiver.Guid | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0605e301-a345-ea11-833a-00155d326100 | Mxolisi komani | Mxolisi | komani | 0000000000012 | NaN | NaN | NaN | 0635118027 | Hans Koopman | NaN | NaN | NaN | NaN | NaN | 2017-02-16T22:00:00Z | 2020-01-17T00:00:00 | False | NaN | False | False | True | False | Male | African | isiXhosa | Child Grant | Group A | Franchisee left the programme | Active | 1e84c406-3deb-e911-8325-0800274bb0e4 | Shieda Komani | Shieda | Komani | 9511070255085 | 0635118027 | Mother | No Matric | isiXhosa | 3b69f593-3d43-ea11-8330-080027a7109a |
| 1 | 1 | 5c1e6f1a-bc45-ea11-833a-00155d326100 | Thateho Ramohlabi | Thateho | Ramohlabi | 1807095666084 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2018-07-08T22:00:00Z | 2020-01-01T00:00:00 | False | NaN | False | False | False | True | Male | African | Setswana | Child Grant | None | Franchisee left the programme | Active | 68814817-0705-ea11-8329-0800274bb0e4 | Mpho Ramohlabi | Mpho | Ramohlabi | NaN | 0780401410 | Mother | No Matric | NaN | 581e6f1a-bc45-ea11-833a-00155d326100 |
| 2 | 2 | 5637445f-eb45-ea11-833a-00155d326100 | Shenaaze van wyk | Shenaaze | van wyk | 0000000000012 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2016-04-10T22:00:00Z | 2020-01-17T00:00:00 | False | NaN | False | False | False | False | Female | African | Afrikaans | NaN | None | NaN | Active | 1e84c406-3deb-e911-8325-0800274bb0e4 | Ragel De Bruin | Ragel | De Bruin | 6004300574080 | 0651332965 | Mother | No Matric | Afrikaans | f32af543-b745-ea11-833a-00155d326100 |
| 3 | 3 | 4da208b6-fa45-ea11-833a-00155d326100 | Leatitia Zona | Leatitia | Zona | 0000000000012 | NaN | NaN | NaN | 0714248050 | Valencia Van Wyk | NaN | NaN | NaN | NaN | NaN | 2015-06-10T22:00:00Z | 2019-10-03T00:00:00 | False | NaN | False | False | True | False | Male | African | Afrikaans | Child Grant | Group A | Franchisee left the programme | Active | 1e84c406-3deb-e911-8325-0800274bb0e4 | Leatitia Zona | Leatitia | Zona | 8612290232085 | 0719602840 | Mother | No Matric | NaN | e99b7a2c-f945-ea11-833a-00155d326100 |
| 4 | 4 | cdb4a38c-4f46-ea11-833a-00155d326100 | Avandro Pieter Klaaste | Avandro Pieter | Klaaste | 1806226123086 | NaN | NaN | NaN | 0625698598 | Eugene Louw | NaN | NaN | NaN | NaN | NaN | 2018-10-07T22:00:00Z | 2019-10-22T00:00:00 | False | NaN | True | True | False | True | Male | Coloured | Afrikaans | Child Grant | Group B | Franchisee left the programme | Active | 1e84c406-3deb-e911-8325-0800274bb0e4 | Esmeralda Klaaste | Esmeralda | Klaaste | 9801020149086 | 0632598548 | Mother | No Matric | NaN | 1254a16f-4f46-ea11-833a-00155d326100 |
| 5 | 5 | 2b427474-5046-ea11-833a-00155d326100 | Gillasha Koopman | Gillasha | Koopman | 1810270892081 | NaN | NaN | NaN | 0769598598 | Leandre Koopman | NaN | NaN | NaN | NaN | NaN | 2018-10-26T22:00:00Z | 2019-11-18T00:00:00 | False | NaN | True | True | True | True | Female | Coloured | Afrikaans | Child Grant | Group B | Franchisee left the programme | Active | 8c168cb8-65ee-e911-8325-0800274bb0e4 | Leandre Koopman | Leandre | Koopman | 0010220038086 | 0725969896 | Mother | No Matric | NaN | 52264458-5046-ea11-833a-00155d326100 |
| 6 | 6 | 52abcbd9-5046-ea11-833a-00155d326100 | Mpaballeng Happiness Maya | Mpaballeng Happiness | Maya | 1805060468080 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2018-05-05T22:00:00Z | 2020-01-06T00:00:00 | False | NaN | False | False | False | True | Female | African | NaN | Child Grant | None | NaN | Active | d5a7ba31-64ee-e911-8325-0800274bb0e4 | Mapaseka Maya | Mapaseka | Maya | 8803020524087 | 0735119766 | NaN | NaN | NaN | f68abdbd-5046-ea11-833a-00155d326100 |
| 7 | 7 | 6e17db16-5146-ea11-833a-00155d326100 | Prihano Davids | Prihano | Davids | 0000000000012 | NaN | NaN | NaN | 0738862330 | Filicia Dawid | NaN | NaN | NaN | NaN | NaN | 2017-03-17T22:00:00Z | 2019-11-13T00:00:00 | False | NaN | True | True | False | False | Male | Coloured | Afrikaans | Child Grant | Group B | Franchisee left the programme | Active | 8c168cb8-65ee-e911-8325-0800274bb0e4 | Filicia David | Filicia | David | 8309150139084 | 0738862330 | Mother | No Matric | NaN | d408dcf1-5046-ea11-833a-00155d326100 |
| 8 | 8 | 6aab0708-5246-ea11-833a-00155d326100 | Kim-lee Wolmarans | Kim-lee | Wolmarans | 1805280403085 | NaN | NaN | NaN | 07458996856 | Delixa | NaN | NaN | NaN | NaN | NaN | 2018-05-27T22:00:00Z | 2019-11-19T00:00:00 | False | NaN | True | True | True | True | Female | Coloured | Afrikaans | Child Grant | Group B | Franchisee left the programme | Active | 8c168cb8-65ee-e911-8325-0800274bb0e4 | Delixa Wolmarans | Delixa | Wolmarans | 8308250162087 | 0712569698 | Mother | No Matric | NaN | 92e45ce2-5146-ea11-833a-00155d326100 |
| 9 | 9 | 63745080-5246-ea11-833a-00155d326100 | Leonardo Jansen | Leonardo | Jansen | 1712036113083 | NaN | NaN | NaN | 081255889 | Juanetta | NaN | NaN | NaN | NaN | NaN | 2017-12-02T22:00:00Z | 2019-11-20T00:00:00 | False | NaN | True | True | True | True | Male | Coloured | Afrikaans | Child Grant | Group A | Franchisee left the programme | Active | 1e84c406-3deb-e911-8325-0800274bb0e4 | Juanetta Jansen | Juanetta | Jansen | 9305040247086 | 0725698856 | Mother | No Matric | NaN | 89ccd869-5246-ea11-833a-00155d326100 |
| Unnamed: 0 | Guid | FullName | FirstName | Surname | IdNumber | AllergyType | DisabilityType | HealthConditions | EmergencyContactNumber | EmergencyContactFullName | EmergencyContactFirstName | EmergencyContactSurname | AlternativePickupFirstName | AlternativePickupSurname | AlternativePickupContactNumber | BirthDate | StartDate | HasAllergy | HasDisability | CaregiverPopiaConsent | CaregiverPhotographyAndFilmingConsent | IsSouthAfricanCitizen | HasIdNumber | Gender | EthnicGroup | HomeLanguage | GrantType | PlaygroupGroup | InactiveReason | Status | Franchisee.Guid | Caregiver.FullName | Caregiver.FirstName | Caregiver.Surname | Caregiver.IdNumber | Caregiver.ContactNumber | Caregiver.RelationshipType | Caregiver.HighestEducationLevel | Caregiver.Language | Caregiver.Guid | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 697107 | 18831 | da87f56b-45a5-ec11-8351-00155d326100 | Ntobiso Amahle | Ntobiso | Amahle | 2012231507081 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2020-12-22T22:00:00Z | 2022-03-16T00:00:00 | False | False | False | False | False | True | Female | NaN | NaN | Child Grant | None | NaN | Active | 08e7b636-cd8b-e711-80e2-005056815442 | Lungisile Dhlamini | Lungisile | Dhlamini | 8504071169083 | NaN | NaN | NaN | NaN | d687f56b-45a5-ec11-8351-00155d326100 |
| 697108 | 18832 | ce14e8db-47a5-ec11-8351-00155d326100 | Kwenziwe Dlamini | Kwenziwe | Dlamini | 1903206744082 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2019-03-19T22:00:00Z | 2022-03-16T00:00:00 | False | False | False | False | False | True | Male | NaN | NaN | Child Grant | None | NaN | Active | e9861a57-7aef-e611-80d3-005056815442 | Nondumiso Dlamini | Nondumiso | Dlamini | 9311170411088 | NaN | NaN | NaN | NaN | ca14e8db-47a5-ec11-8351-00155d326100 |
| 697109 | 18833 | 3c70c606-55a5-ec11-8351-00155d326100 | Thandolwethu Sekonyela | Thandolwethu | Sekonyela | 1811146251086 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2018-11-13T22:00:00Z | 2022-03-16T00:00:00 | False | False | False | False | False | True | Male | NaN | NaN | Child Grant | None | NaN | Active | 33c90931-6409-ec11-834c-00155d326100 | Nomasonto Sekonyela | Nomasonto | Sekonyela | 9009090371089 | NaN | NaN | NaN | NaN | 3870c606-55a5-ec11-8351-00155d326100 |
| 697110 | 18834 | cfe1de2d-57a5-ec11-8351-00155d326100 | ratile lethole | ratile | lethole | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2022-03-16T00:00:00 | False | False | False | False | False | True | NaN | NaN | NaN | Child Grant | None | NaN | Active | ed304ca3-eff1-e611-80d3-005056815442 | puleng evodia lethole | puleng evodia | lethole | 8704100724086 | NaN | NaN | NaN | NaN | cbe1de2d-57a5-ec11-8351-00155d326100 |
| 697111 | 18835 | d4ad822e-bfa5-ec11-8351-00155d326100 | Skylar Horn | Skylar | Horn | 2003111425080 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2020-03-10T22:00:00Z | 2022-03-17T00:00:00 | False | False | False | False | False | True | Female | NaN | NaN | Child Grant | None | NaN | Active | d38e0456-2f42-e911-828d-0800274bb0e4 | Mary-Ann Horn | Mary-Ann | Horn | 9201180065083 | 0655907575 | Mother | NaN | NaN | e6fcf2f9-5797-eb11-8346-00155d326100 |
| 697112 | 18836 | e2d8ae07-c6a5-ec11-8351-00155d326100 | Lwandle Ntuli Ntuli | Lwandle Ntuli | Ntuli | 1711075092081 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2017-11-06T22:00:00Z | 2022-03-17T00:00:00 | False | False | False | False | False | True | Male | NaN | NaN | Child Grant | None | NaN | Active | ef759b16-c60a-ea11-8329-0800274bb0e4 | Thulisile Ntuli | Thulisile | Ntuli | 9008060420084 | NaN | NaN | NaN | NaN | ded8ae07-c6a5-ec11-8351-00155d326100 |
| 697113 | 18837 | 63d0fbf7-c6a5-ec11-8351-00155d326100 | Ntando Kearabetswe Thwala | Ntando Kearabetswe | Thwala | 1708200745081 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2017-08-19T22:00:00Z | 2022-03-17T00:00:00 | False | False | False | False | False | True | Female | NaN | NaN | Child Grant | None | NaN | Active | ef759b16-c60a-ea11-8329-0800274bb0e4 | Lydia Thwala | Lydia | Thwala | 4503290442085 | NaN | NaN | NaN | NaN | 5ad0fbf7-c6a5-ec11-8351-00155d326100 |
| 697114 | 18838 | bc2c8931-c9a5-ec11-8351-00155d326100 | Nkanyezi Zimkhitha Zamisa | Nkanyezi Zimkhitha | Zamisa | 1805190953084 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2018-05-18T22:00:00Z | 2022-03-17T00:00:00 | False | False | False | False | False | True | Female | NaN | NaN | Child Grant | None | NaN | Active | d8995a5c-3b5c-e911-82e3-0800274bb0e4 | Thembekile Zamisa | Thembekile | Zamisa | 9503090119086 | NaN | NaN | NaN | NaN | b82c8931-c9a5-ec11-8351-00155d326100 |
| 697115 | 18839 | 71b1535a-caa5-ec11-8351-00155d326100 | Pelontle Felicia Tumaeletse | Pelontle Felicia | Tumaeletse | 1812210369085 | NaN | NaN | NaN | 0797291366 | Tshepang Tumaeletse | NaN | NaN | NaN | NaN | NaN | 2018-12-20T22:00:00Z | 2022-01-24T00:00:00 | False | False | False | True | True | True | Female | African | Setswana | Child Grant | Group A | NaN | Active | 31aa94db-98fc-e911-8329-0800274bb0e4 | Tshepang Tumaeletse | Tshepang | Tumaeletse | 0004091191082 | 0762988267 | Mother | No Matric | NaN | a06bc816-caa5-ec11-8351-00155d326100 |
| 697116 | 18840 | 9e83ea9d-cca5-ec11-8351-00155d326100 | Rachel Madondo | Rachel | Madondo | 1908170000001 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2019-08-16T22:00:00Z | 2022-03-17T00:00:00 | False | False | False | False | False | True | Female | NaN | NaN | Child Grant | None | NaN | Active | d8995a5c-3b5c-e911-82e3-0800274bb0e4 | Kumbirai Mutero | Kumbirai | Mutero | 8506200000002 | NaN | NaN | NaN | NaN | 9a83ea9d-cca5-ec11-8351-00155d326100 |
Most frequently occurring
| Unnamed: 0 | Guid | FullName | FirstName | Surname | IdNumber | AllergyType | DisabilityType | EmergencyContactNumber | EmergencyContactFullName | AlternativePickupContactNumber | BirthDate | StartDate | HasAllergy | HasDisability | CaregiverPopiaConsent | CaregiverPhotographyAndFilmingConsent | IsSouthAfricanCitizen | HasIdNumber | Gender | EthnicGroup | HomeLanguage | GrantType | PlaygroupGroup | InactiveReason | Status | Franchisee.Guid | Caregiver.FullName | Caregiver.FirstName | Caregiver.Surname | Caregiver.IdNumber | Caregiver.ContactNumber | Caregiver.RelationshipType | Caregiver.HighestEducationLevel | Caregiver.Language | Caregiver.Guid | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0605e301-a345-ea11-833a-00155d326100 | Mxolisi komani | Mxolisi | komani | 0000000000012 | NaN | NaN | 0635118027 | Hans Koopman | NaN | 2017-02-16T22:00:00Z | 2020-01-17T00:00:00 | False | NaN | False | False | True | False | Male | African | isiXhosa | Child Grant | Group A | Franchisee left the programme | Active | 1e84c406-3deb-e911-8325-0800274bb0e4 | Shieda Komani | Shieda | Komani | 9511070255085 | 0635118027 | Mother | No Matric | isiXhosa | 3b69f593-3d43-ea11-8330-080027a7109a | 37 |
| 1 | 1 | 5c1e6f1a-bc45-ea11-833a-00155d326100 | Thateho Ramohlabi | Thateho | Ramohlabi | 1807095666084 | NaN | NaN | NaN | NaN | NaN | 2018-07-08T22:00:00Z | 2020-01-01T00:00:00 | False | NaN | False | False | False | True | Male | African | Setswana | Child Grant | None | Franchisee left the programme | Active | 68814817-0705-ea11-8329-0800274bb0e4 | Mpho Ramohlabi | Mpho | Ramohlabi | NaN | 0780401410 | Mother | No Matric | NaN | 581e6f1a-bc45-ea11-833a-00155d326100 | 37 |
| 2 | 2 | 5637445f-eb45-ea11-833a-00155d326100 | Shenaaze van wyk | Shenaaze | van wyk | 0000000000012 | NaN | NaN | NaN | NaN | NaN | 2016-04-10T22:00:00Z | 2020-01-17T00:00:00 | False | NaN | False | False | False | False | Female | African | Afrikaans | NaN | None | NaN | Active | 1e84c406-3deb-e911-8325-0800274bb0e4 | Ragel De Bruin | Ragel | De Bruin | 6004300574080 | 0651332965 | Mother | No Matric | Afrikaans | f32af543-b745-ea11-833a-00155d326100 | 37 |
| 3 | 3 | 4da208b6-fa45-ea11-833a-00155d326100 | Leatitia Zona | Leatitia | Zona | 0000000000012 | NaN | NaN | 0714248050 | Valencia Van Wyk | NaN | 2015-06-10T22:00:00Z | 2019-10-03T00:00:00 | False | NaN | False | False | True | False | Male | African | Afrikaans | Child Grant | Group A | Franchisee left the programme | Active | 1e84c406-3deb-e911-8325-0800274bb0e4 | Leatitia Zona | Leatitia | Zona | 8612290232085 | 0719602840 | Mother | No Matric | NaN | e99b7a2c-f945-ea11-833a-00155d326100 | 37 |
| 4 | 4 | cdb4a38c-4f46-ea11-833a-00155d326100 | Avandro Pieter Klaaste | Avandro Pieter | Klaaste | 1806226123086 | NaN | NaN | 0625698598 | Eugene Louw | NaN | 2018-10-07T22:00:00Z | 2019-10-22T00:00:00 | False | NaN | True | True | False | True | Male | Coloured | Afrikaans | Child Grant | Group B | Franchisee left the programme | Active | 1e84c406-3deb-e911-8325-0800274bb0e4 | Esmeralda Klaaste | Esmeralda | Klaaste | 9801020149086 | 0632598548 | Mother | No Matric | NaN | 1254a16f-4f46-ea11-833a-00155d326100 | 37 |
| 5 | 5 | 2b427474-5046-ea11-833a-00155d326100 | Gillasha Koopman | Gillasha | Koopman | 1810270892081 | NaN | NaN | 0769598598 | Leandre Koopman | NaN | 2018-10-26T22:00:00Z | 2019-11-18T00:00:00 | False | NaN | True | True | True | True | Female | Coloured | Afrikaans | Child Grant | Group B | Franchisee left the programme | Active | 8c168cb8-65ee-e911-8325-0800274bb0e4 | Leandre Koopman | Leandre | Koopman | 0010220038086 | 0725969896 | Mother | No Matric | NaN | 52264458-5046-ea11-833a-00155d326100 | 37 |
| 6 | 6 | 52abcbd9-5046-ea11-833a-00155d326100 | Mpaballeng Happiness Maya | Mpaballeng Happiness | Maya | 1805060468080 | NaN | NaN | NaN | NaN | NaN | 2018-05-05T22:00:00Z | 2020-01-06T00:00:00 | False | NaN | False | False | False | True | Female | African | NaN | Child Grant | None | NaN | Active | d5a7ba31-64ee-e911-8325-0800274bb0e4 | Mapaseka Maya | Mapaseka | Maya | 8803020524087 | 0735119766 | NaN | NaN | NaN | f68abdbd-5046-ea11-833a-00155d326100 | 37 |
| 7 | 7 | 6e17db16-5146-ea11-833a-00155d326100 | Prihano Davids | Prihano | Davids | 0000000000012 | NaN | NaN | 0738862330 | Filicia Dawid | NaN | 2017-03-17T22:00:00Z | 2019-11-13T00:00:00 | False | NaN | True | True | False | False | Male | Coloured | Afrikaans | Child Grant | Group B | Franchisee left the programme | Active | 8c168cb8-65ee-e911-8325-0800274bb0e4 | Filicia David | Filicia | David | 8309150139084 | 0738862330 | Mother | No Matric | NaN | d408dcf1-5046-ea11-833a-00155d326100 | 37 |
| 8 | 8 | 6aab0708-5246-ea11-833a-00155d326100 | Kim-lee Wolmarans | Kim-lee | Wolmarans | 1805280403085 | NaN | NaN | 07458996856 | Delixa | NaN | 2018-05-27T22:00:00Z | 2019-11-19T00:00:00 | False | NaN | True | True | True | True | Female | Coloured | Afrikaans | Child Grant | Group B | Franchisee left the programme | Active | 8c168cb8-65ee-e911-8325-0800274bb0e4 | Delixa Wolmarans | Delixa | Wolmarans | 8308250162087 | 0712569698 | Mother | No Matric | NaN | 92e45ce2-5146-ea11-833a-00155d326100 | 37 |
| 9 | 9 | 63745080-5246-ea11-833a-00155d326100 | Leonardo Jansen | Leonardo | Jansen | 1712036113083 | NaN | NaN | 081255889 | Juanetta | NaN | 2017-12-02T22:00:00Z | 2019-11-20T00:00:00 | False | NaN | True | True | True | True | Male | Coloured | Afrikaans | Child Grant | Group A | Franchisee left the programme | Active | 1e84c406-3deb-e911-8325-0800274bb0e4 | Juanetta Jansen | Juanetta | Jansen | 9305040247086 | 0725698856 | Mother | No Matric | NaN | 89ccd869-5246-ea11-833a-00155d326100 | 37 |